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Some Theorems on Properties of DC 
Equations of Nonlinear Networks 


By I. W. SANDBERG and A. N. WILLSON, JR. 
(Manuscript received July 3, 1968) 


Several results are presented concerning the equation F(x) + Ax = B 
(with F(-) a “diagonal” nonlinear mapping of real Euclidean n-space E” 
into itself, and A a real n X n matrix) which plays a central role in the 
de analysis of transistor networks. In particular, we give necessary and 
sufficient conditions on A such that the equation possesses a unique solution 
x for each real n-vector B and each strictly monotone increasing F(-) that 
maps E” onto ttself. 

There are several direct circutt-theoretic implications of the results. For 
example, we show that if the short-circuit admittance matrix G of the linear 
portion of the dc model of a transistor network satisfies a certain dominance 
condition, then the network cannot be bistable. Therefore, a fundamental 
restriction on the G matrix of an interesting class of switching circuits 1s 
that at must violate the dominance condition. 


I. INTRODUCTION 


For each positive integer 7 let 5” denote that collection of mappings 
of the real n-dimensional Euclidean space E” onto itself, defined by: 
F eS” if and only if there exist, forz = 1, --- , n, strictly monotone 
increasing functions f; mapping E’ onto E’ such that, for each ¢ = 
Ce a)" eH", F(z) = (f(a), «+ fn(@n))’ 

The main purpose of this paper is to report on some results concerning 
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properties of the equation 
F(x) + Ax = B, (1) 


where A is an n X n matrix of real numbers, F maps Z” into E”, and 
BE”. In particular, a condition to be satisfied by A is given which is 
both necessary and sufficient to guarantee that for each F 2 5” and each 
B « EK” there exists a unique solution of equation (1). 

We also study the problem of obtaining bounds on the solution of 
equation (1). These bounds show that (if F ¢« S" and our condition on A 
is satisfied) the solution depends continuously on B. The bounds are 
often of use in computing the solution by standard iteration methods 
such as the Newton-Raphson method. By appealing to a theorem of 
R. 8. Palais it is shown that the bounds can also be used to obtain a 
theorem essentially the same as, but somewhat weaker than, our 
principal result. 

Several results can be found in the literature which specify sufficient 
conditions for the existence of a unique solution of equation (1). For 
example, if A is positive semidefinite then a special case of a theorem 
of Ref. 1 guarantees the existence of a unique solution of equation (1) 
for all those F ¢ S" which have the property that the slope of each f; 
is bounded from above and below by positive constants, and for all 
B < i". This theorem also specifies that a certain iteration scheme will 
always converge to the solution. 

A theorem of G. J. Minty’, when applied to equation (1), also implies 
essentially the same result. The boundedness condition on the slopes 
of the functions f; is not required by Minty’s theorem. On the other 
hand, Minty’s theorem does not provide a procedure for computing the 
solution of equation (1). 

In Ref. 3 it is proved that a sufficient condition for the existence and 
uniqueness of a solution of equation (1) for all F « S” and Be EK" is 
that A satisfy a weak row-sum dominance condition: 


a;; 2 > Law| P= Ay te 
77 


Other information concerning the location and the computation of 
the solution is also given in Ref. 3. 

The class of matrices satisfying the condition of our theorem (which 
is defined in Section III and denoted by P,) includes all positive semi- 
definite matrices as well as all matrices which satisfy any one of several 

* Appendix A contains a simpler proof of a similar result and a proof of a new 


related result. These results specify convergent algorithms for obtaining the 
solution. 
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dominance conditions. Many other matrices are included in P, ; and 
since the condition of our theorem is both a necessary and a sufficient 
one, we are assured that P, is the largest class of matrices A for which 
equation (1) has a unique solution for all F e S$" and all Be EL”. 


II. NONLINEAR NETWORKS 


Equation (1) is often encountered in the study of nonlinear electrical 
networks. In the case of networks containing only resistors (that is, 
linear resistors with nonnegative resistance), dependent and inde- 
pendent sources, and two-terminal nonlinear resistors that are described 
by functions in &* (diodes, for example), this is rather obvious.” Even 
for networks which contain more general nonlinear devices, however, 
equation (1) can often provide a convenient characterization. For 
example, D. A. Calahan shows in his recent book that the transistor 
network of Fig. 1 may be described by the equation 


en = 4 + 0.0225 oe V, 0.00177V.. 
I,,(e""*? — 1) —0.168 0.4941 |V, —0.188V,, 


if the Ebers—Moll model is used to represent the transistor. (See pp. 
13ff of Ref. 4.) In this equation J,,, I.,, q, k, T, and V,, all represent 
fixed real parameters. It is quite trivial to apply the theory of this paper 
(in particular, Corollary 3 of Section IV) to Calahan’s example and 
prove that this equation has a solution, the solution is unique, and 
the solution depends continuously on V,,. We also show how bounds 
on the solution can be obtained. 

















Fig. 1— Biased transistor-stage, 
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More generally, it is frequently the case that networks which contain 
transistors, as well as the previously mentioned linear and nonlinear 
elements, may be described by the equation 


TF(x) + Ax = B. (2) 


In this case, x is a vector whose components are the voltages across 
the nonlinear resistors and the transistor base-emitter and base-collector 
voltages. The n X n matrix A is the y-parameter matrix of the linear 
n-port network which is obtained by removing all nonlinear resistors 
and transistors and setting the value of each independent source to 
zero. The function 7'F (x) describes the behavior of the nonlinear re- 
sistors and the transistors. It happens that the matrix TJ is nonsingular; 
therefore equation (2) can be put into the form of equation (1). 

Networks which contains inductors and capacitors as well as the 
memoryless elements already mentioned are of course described by dif- 
ferential equations. Even the study of such networks, however, can 
often lead to the consideration of equations of the same type as equa- 
tion (1). One usually finds the solution of such an equation is necessary, 
for example, when computing the solution of the differential equations 
by using some implicit numerical integration formula. 

The problem of determining the equilibrium states of the above- 
mentioned dynamic networks is one in which the consideration of 
equations of type (1) often arises in perhaps a more direct manner. 
In this regard, if it happens that equation (1) has a unique solution, 
then the network cannot possibly be bistable. 

When the determination of equilibrium states of a transistor net- 
work leads first to the consideration of equation (2), then as a rather 
direct application of our existence and uniqueness theorem it follows 
that if the matrix A satisfies a weak column-sum dominance condition, 

a;; 2 dD | ais |, ee Oe ae Ly 

iwi 

then 7'~*A e« P, and hence the network has exactly one equilibrium 
state. This result and related results which are proved in Section IV 
have the following interesting corollary: One cannot synthesize a 
bistable network which consists of resistors, inductors, capacitors, 
diodes, independent voltage and current sources, and one (Ebers—Moll 
modeled) transistor—or even an arbitrary number of (Ebers—Moll 
modeled) transistors with a common base connection. 

The authors feel that in many respects the main contributions of this 
paper are in the techniques used to prove the results. For this reason, we 
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have not chosen to summarize all of the results at the outset and relegate 
proofs to later sections. But rather, the results and the proofs will appear 
in the order in which they will best illustrate the techniques developed. 


II]. MATRICES OF CLASSES P AND Po 


The following notation will be used throughout the remainder of the 
paper: The origin in E” will be denoted by 0. If D is a diagonal matrix 
then D > 0 (D = 0) means that each element of D on the main diagonal 
is positive (nonnegative). 

In Ref. 5 and Ref. 6 M. Fiedler and V. Ptak define the classes of 
matrices denoted by P and P, . They in fact prove that the following 
properties of a square matrix A are equivalent: 


(z) All principal minors of A are positive. 
(iz) For each vector x ¥ 6 there exists an index k such that 2,y, > 0 
where y = Ax. 
(cit) For each vector x + 6 there exists a diagonal matrix D, > 0 such 
that the scalar product (Az, D,x) > 0. 
(iv) For each vector x ~ @ there exists a diagonal matrix H, = 0 such 
that (Aw, H,x) > 0. 
(v) Every real eigenvalue of A, as well as of each principal submatrix 
of A, is positive. 
The class of all matrices satisfying one of the above conditions is de- 
noted by P. Fiedler and Ptdk prove that the following properties of a 
square matrix A are also equivalent: 


(z) All principal minors of A are nonnegative. 
(27) For each vector x 6 there exists an index k such that x, 4 0 and 
LY, = Owherey = Ax. 
(iz) For each vector x ¥ 6 there exists a diagonal matrix D, = O such 
that (7, D,z) > 0 and (Az, D,x) = 0. 
(zv) Every real eigenvalue of A, as well as of each principal submatrix 
of A, is nonnegative. 


The class of all matrices satisfying one of the above conditions is de- 
noted by Po. 


The following theorems follow directly from the above definitions. 


Theorem 1. If A ¢ Py then for every diagonal matrix A = 0 (A > 0), 
A+AeP,(A+AeP). 

Proof: Let x ¥ 6. Then, since A ¢ P, , there exists an index k such 
that x, ~ 0 and x,(Az), = 0. Thus, xz,(Avx + Az), 2 0(>0). O 
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In particular, Theorem 1 implies that if A « P, and A 2 0 (A > 0) 
then det (A + A) 2 0 (>0). 


Theorem 2. If Ae P then A ¢ P. 


Proof: Suppose A e P. Let x ¥ 6 be given and let y = A™'x. y ¥ 0 
since A~’ is nonsingular. Thus, there exists a diagonal matrix D > 0 
such that (Ay, Dy) > 0, which implies (x, DA~*x) > 0, or (Dz, A~*x)>0, 
or (A~'x, Dx) > 0. That is, for every x ~ 6 there exists D > 0 such that 
(A™'x, Dz) > 0. Hence Ae P. O 

Because of the similarity of the definitions of the classes of matrices 
P and P, , one might conjecture that this proposition is also true: If 
Ast P,, and det A ¥ 0, then A~’ « P, . This conjecture is in fact true. 
Interestingly enough, however, its proof is not obtained as one might at 
first suspect, by simply modifying the proof of Theorem 2. Moreover, the 
proof of this conjecture does not even seem to follow directly from any 
of the above definitions of P, . Rather, upon making the trivial observa- 
tion that for every diagonal matrix D > 0, det (A~’ + D) = det (A7’) 
-det (D™* + A)-det (D), the conjecture is easily seen to follow from the 
fact that det (D + A) # 0 for every diagonal D > 0 if and only if 
As P,. This fact is a direct corollary to the proof of Theorem 3. 


IV. EXISTENCE AND UNIQUENESS THEOREM 


The following theorem is the principal result of this paper. 


Theorem 8. If Aisann X n matrix then there exists a unique solution 
of equation (1) for each F ¢ S" and for each B ¢ E” if and only if Ae Py. 

Proof: (if) Let Ae Py, F eS", and Be E”. The solution of equation 
(1) is then unique (if it exists) since if x and y are both solutions then, 
using the strict monotonicity property of F, there exists a diagonal 
matrix D > 0 such that F(x) — F(y) = D(« — y). But{[D + A](@ — y)= 
6 and, by Theorem 1, D + A is nonsingular. This means that x = y. 

We prove the existence of a solution of equation (1) by induction. For 
k =1,---,7, let 


fi(a1) Qi, °° * Aye b, 
F (2) = } A, — * * * ) B, cams 
fu(Xx) Any °° * Axx b, 


Clearly, A, e Py) , F, © F", and B, e EL”. Also, it is clear that there exists a 
unique solution of F(x) + A, = B, for each F, e §’ and for each B, « E’, 
and that this solution is a continuous function of }, . 
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Assume that there exists a unique solution of F,(«)-+A,v=B, for each 
F, ¢ 5, B, ¢ E", and that this solution depends continuously on any 
scalar parameter 7 upon which B, depends continuously. Let the 
matrices A; ,4, and A,.,,, be defined by 


Qi k+1 
Annu = . ’ 
Ok k+1 
Asie = [Qes1.°°° Ax+1,4J- 


Then, for every real number x;,, , the equation 
F(x) + Ayr + Ay pati = B, (3) 


has a (unique) solution which is a continuous function of x,,, and of 7. 
Let the components of this solution be denoted by x; = m; (x41, 7), 
fort = 1, --: , k, and define the vector M,(x41, 7) by M, = (m,°°--; 
My)’. 
We now prove that the function 
P(Li+1 +1) = Agar,cM (Casa » 0) + Aner eei€ear — bi1(m) 


is monotone increasing in 2,4, : Let zi,, , 27,, ¢ EH’ with a4, < Ua, - 
Then, if 17* (M’) denotes the solution of equation (8) when 2,44, = 
Tier (Trai), we have 


FM’) — F(M*) + AAM? — MY) + Agasi(tier — Civ) = 6. 


Because of the strict monotonicity of the function Ff, , however, there 
exists a k X k diagonal matrix A > 0 such that 


F,(M?) — F,(M°) = a(M? — M?). 


Hence, 
M’* — M* = [A + Ag| Agee as _ ta) 
Thus, 


o(ti+) = p(Li+1) = {Qnei,e+1 = Axzsi,xLA zi A;| Agel Gien = Tiwi): 


But then, from the easily verified relation 
0 
det A ae Ager 


0---0 
det (A + A,) 





Qe+ey,k+1 Arete st Ay] "Ag esi = 
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and since 


0 
det (A + A,) > 0, det /N\: + A.s,| = 0 (Theorem 1), 
0---0 } 
and 224; — Xa, > 0, it follows that o(2?,,) = ¢(xh,,). 


Now since ¢ is monotone increasing and, obviously, continuous in 
Lz+1 1b follows that the left side of the equation 


fri (Ve+1) oe P(Ln+1) = 0 (4) 


is a strictly monotone increasing function mapping Z’ onto E’, and hence 
equation (4) has a unique solution. If z?,, denotes this solution then 


mi(Xb ») 


mMi(Xi+1) 
esi 
is the (unique) solution of 


Pyar) + Assit = Busi. 


We must now prove that this solution is a continuous function of any 
scalar parameter 7 upon which B,,, depends continuously. It suffices to 
prove that x,,;, depends continuously on yn (see equation (3)). This may 
be done as follows: 


Let x24, be the solution of equation (4) corresponding to 7 = 7”. 
That is, let 


fris(@aa) a o(tes1 ) 1) = 0, 


and let e > 0 be given. Since f,,; is a strictly monotone increasing map- 
ping of EZ’ onto EH’, so is fz3, , and hence f;2, is continuous. Hence, there 
exists 6’ > 0 such that if | fr1i(@24.) — fear(tea1) | < & then | 2?,, — 
Tea. | < €. Since ¢ is a continuous function of 7, there exists 6 > 0 such 
that | 7° — 1 | < éimplies | o(2?,, , 7°) — o(22,,,7) | < 6. If| n° -nI|< 
6, and 


fas (irr) + O(te41, 7) = 0, 
then, 
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fev @evs) — fess@rr) + g(r >) — Gee.» 1) 
= —[e(tisr , 0°) — o@evr , 2]. 
But since both fi+1 and ¢ are monotone increasing in X41 , 
(wear — Cesi)[fesi(@esr) — ferr1@ess)] = O, 
and 
(ter — Ceer)[etier » 1) — Ges, 2] = 0. 
Therefore, 
| (evr = pei) teen Coe) — fars(te+r)] | 
S | (char — tes) [O(tier , 0°) — olaier , 2] |. 
Now, if 22,, = Yi. then of course | x24, — 2i+1| < €. Otherwise, 
| fasr(aear) — frsrQtesr) | S | o@reer , 0") — o@inr, 2) |. 
But then, 
| fesr(@har) — fers@arr) | < 8, 
and hence | x74, — 2i+1| < ¢. Thus, 24, is a continuous function of 7. 


(only tf) Suppose A ¢ P,. If det A < 0 then for sufficiently small 
¢ > 0, det (J + A) < 0. For sufficiently large ¢, however, 


det (¢{1 + A) = ¢"-det (1 + : 4) > 0. 
Thus, since det (¢J + A) is a continuous function of ¢, there is some 
value of ¢ > 0 such that det (¢J + A) = 0. For this value of ¢ let 
F(x) = ¢Ix. Clearly, for this choice of F eS", equation (1) cannot have a 
unique solution. 

If det A = 0, but A has a negative principal minor, we can still find a 
diagonal matrix A > 0 such that det (A + A) = 0; however, in this case 
A will not, in general, be simply the identity matrix multiplied by a 
positive constant ¢. . 

For some positive integer k < n let A havea k X k principal minor 
which is negative and let 


A™ = diag [6,,°-+ , dal. 


Since the determinant of A + A is not altered if any two rows and then 
the corresponding pair of columns are interchanged we may, without 
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loss of generality, assume that the matrix A is partitioned as 


A; Ag 
where A, isak X k matrix with det A, < 0. Let & > 0 be chosen so small 


that det (7 + A,) < 0, and let 6; = --- = 6, = & Now, if 6:4; = 
“++ = 6, = € > O, then 


det (A + A) = det ha me As 


A; GP As 
tl + A, A» 
= ¢"*. det 
1 1 . 
es A3 I “e A, 
gG = ¢ 


Thus, for ¢ > 0 chosen to be sufficiently large, det (A + A) < 0. 
(det (AY + A) > ¢”*-det (EJ + A) < 0 as €& > ~.) Now, if for 
n > 0, A® = ql, then it is clear that for 7 chosen sufficiently large, 
det [A Ay = a det? Ay S 0 Thus & 


A(e) = «AM + (1 — 2A, 


it is clear, since det [A(O) + A] > 0 and det [A(1) + A] < 0 and since 
det [A(e) + A] is a continuous function on 0 S ¢e S 1, that there is a 
value of e > 0 (0 < € < 1) such that det [A(e) + A] = 0. For this value 
of «, A(e) > O is the required diagonal matrix. © 

Notice that our proof shows that if F « 5" and A « P, , then the solu- 
tion of equation (1) depends continuously on any scalar parameter 
upon which B depends continuously. The arguments of Section V show, 
under these assumptions on F and A, that the operator (F + A)~°* is 
in fact a continuous map of £” into itself. 

In the proof of Theorem 3 we see that the uniqueness of the solution 
follows simply from the hypotheses that each f; is strictly monotone 
increasing and that A « P, . The additional hypotheses that each f; 
is continuous and maps E’ onto E* are not necessary (continuity of 
each f; is not explicitly hypothesized, but follows from the ‘“monoto- 
nicity”’ and ‘‘onto” hypotheses). Hence, we have: 


Corollary 1. If, fori = 1, --- ,n, S; 7s a subset of E’, and if S = 8, X 
“++ & 8, , and if F(x) = (fi(x:), --- , fa(v,))', where each f; maps E"* 
into E’ and is strictly monotone increasing on S; , thenif Ae Ppand Be E”, 
there exists at most one solution of equation (1) in S. 
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We now prove another interesting corollary of Theorem 3. We first 
define some additional notation. 

For each positive integer n let 8” denote the collection of all subsets of 
E" defined by: S « 8" if and only if S = S' X --- X S" where, for7 = 
1, --- , n, S' C EH’ and S’ has the same cardinality as Z*. For each 
S C 8" we define the collection 5"(S) of functions mapping S onto &” by: 
F 2 5"(S) if and only if there exist, for? = 1, --- , n, strictly monotone 
increasing functions f; mapping S’ onto E* such that for each x e S”, 


F(x) = (f:(@1); a lata) 


Corollary 2. If A is ann X n matrix and the collection 5"(S) 7s non- 
empty then there exists a unique solution of the equation 


F,(z) + AF,(z) = B (5) 


for each F, « S"(S), F, ¢ F"(S), and each B « K” if and only if Ae Py . 

Proof: Since Ff, ¢ §"(S), Fy’ : E” > S exists and Ff, o FZ « 5”. Thus, 
there exists a unique solution of equation (5) if and only if there exists 
a unique solution of 


FE "Y)) + 4y= 8B. O 


As special cases of Corollary 2 we have: there exists a unique solution 
of each of the equations 


F,(x) + AF.(x) = B, 
and 
x+ AF(2) = B, 
for each F, , F., F eS" and each Be HL" if and only if Ae Py. 


In Theorem 3 (and Corollary 2) the hypothesis that each of the func- 
tions f; is an onto mapping is quite necessary in order to guarantee the 
existence of a solution for each A e Py. In the following example all of 
the hypotheses of Theorem 3 except this one are satisfied: 


CO Hg: eo a — I 
e? —a,+2 = —-2. 


It is of course impossible for these equations to have a solution since, by 
adding both sides, we find that the solution would have to satisfy 


1 7a 
er +e? = —1, 


which is absurd, 


12 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1969 


Even though the functions jf; are not ‘‘onto,” it is still possible to 
specify sufficient conditions for the existence of a unique solution of 
equation (5) [and equation (1)] by strengthening the hypothesis on 
the matrix A—namely, by requiring that A « P. This is the essence 
of Corollary 3. We first require additional notation. 

With 8” defined as above, we define, for each S e 8”, the collection of 
functions 33S) mapping S into EZ” by: F e 5{(S) if and only if there 
exist, for 7 = 1, --- , nm, monotone increasing functions f; mapping 
S* onto a connected set in Z’ such that, for each ae S, F(x) = (fi(x1),°- +, 
f,(t,))’. When S = E” we denote 7(S) by 52. 


Corollary 3. If A isan n X n matrix then there exists a unique solution 
of equation (5) for each F, « S>(S), PF. ¢ 5°(S), or F, « S"(S), F, « F5(S), 
and for each Be E”, if Ae P. 


Proof: If F, 2 §"(S), Fz! : E* > S exists and F, o Fz: ¢ 5%. Thus, in 
this case, there exists a unique solution of equation (5) if there exists a 
unique solution of 


F,(Fz*(y)) + Ay = B. (6) 


Now, since A ¢ P, it follows from the fact that the determinant of a 
matrix is a continuous function of each of its elements, that there is a 
matrix A* ¢ P C P, and an e > 0, such that A.= ef + A*, Hence, 
equation (6) is equivalent to 


F(y) + A*y = B, (7) 
where we have defined . 
Fiy) = F,(Fy*(y)) + ely. 


But, since F, o FZ* ¢ Sj and el ¢ S$’, it follows that F e« s". Therefore, 
since A* ¢ Py, equation (7) and hence equation (6) and hence equation 
(5) have unique solutions. | 

The case when F, ¢ $"(S) and F, « 53S) can be reduced to the case 
just considered by making the simple observations that, in this case, 
equation (5) has a unique solution if 


AF, (x) + F.(z) = A“B 


has a unique solution, and A e P implies A~’ « P (Theorem 2). O 
In Corollary 3 a sufficient condition is given for the existence of a 
unique solution to say equation (1) when the functions f; which specify 
F are not necessarily mappings onto EZ’. That the condition (A ¢ P) is not 
“necessary is easily demonstrated by the counterexample: Let F ¢ S$ and 
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B ¢ E’; then the equations 
fi(t:) — t2 = b,, and fa(v2) + a, = by 


have a unique solution in spite of the fact that the matrix 
A= ° - ¢P. 
1 0 
This is true because the function f.(f,(v1) — 6,) is obviously a continuous 


monotone increasing function of x, , and hence the left side of the equa- 
tion 


fo(f:(a1) — b1) + ay = by (8) 


is a strictly monotone increasing mapping of Ht onto Et. Thus equa- 
tion (8) has a unique solution. 


V. BOUNDED SOLUTIONS AND RELATED PROBLEMS 


For many systems whose behavior is described by an equation having 
the form of equation (1), the vector B may be regarded as the sys- 
tem’s input and the vector x may be regarded as the system’s response, 
or output. Thus, if a sequence B’, B’, B’, --- of input vectors for the 
system is given, the corresponding sequence 2’, x’, x°, --+ of output 
vectors is specified by equation (1). An important property that such 
systems might have is that of producing a bounded sequence of output 
vectors for each bounded sequence of input vectors; that is, the property 
that whenever an input sequence B’, B’, B’, --- is contained in some 
bounded region of EZ”, then the corresponding output sequence x’, 2’, 
x’, «++ (exists and) also is contained in some bounded region of E”. 
By considering matrices A which are not members of P, , it is easy to 
demonstrate that all equations having the form of equation (1) do not 
have this property. For example, if f(x) = x + e” (f « &’), then the se- 
quence of solutions of the equation f(z) + (—1)z = b is unbounded, 
even though the sequence b = 1, 4, 4, --- of inputs is bounded. The fact 
that one must resort to matrices A which are not in Po , and the fact that 
by choosing any A ¢ P, , an example of the above kind can be constructed 
by an appropriate choice of F e« 5", follows from our next theorem. 


Theorem 4. If A ts ann X n matrix then A e Po tf and only tf for 
each F « S" and each unbounded sequence of points x’, x, x, «++ in EB, 
the corresponding sequence B’, B’, B’, --- (B* = F(x") + Az‘, k = 1, 
2, 3, -++) 7s unbounded. 
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Proof: (if) If A ¢ Po then, as shown in the ‘‘only if” part of the proof 
of Theorem 3, there exists a diagonal matrix D > 0 such that D + A is 
singular. Hence, there exists some point p e L”, p # 6, such that Dp + 
Ap = 06. Let p; , the j-th component of p, be nonzero. Let the diagonal 


elements of the matrix D be denoted by d,, --- , d, and let the mapping 
F ¢ S" be defined by 
Cee be ; for 24 j, 
d,a; + e', for: = 9. 


If p; < 0 let « = 1, if p; > O let e = —1. Consider the unbounded 
sequence x’, v”, x, --- defined by a* = k-e-p, for k = 1, 2, 3, ---. 


The members of the corresponding sequence B', B’, B’, --- are BY = 
(0, --- , 0, e*”#, 0, --- , 0)', k = 1, 2, 3, --- , where the j-th element 
of each B’ is nonzero. Since for k = 1, 2,3, --: ,kep; < 0, the sequence 


B’, B’, B’, --- is bounded. 


(only if) Our proof of the “only if” part of Theorem 4 consists of 
proving Theorem 5 which is referred to later for another purpose. O 


Theorem 5. Let F = (f,(-), «++ , fa(-))' ¢ &, A & Po, and, fori= 


1,---,n, a; S B; be given. There exist, fort = 1, --- , n, real numbers 
y; < 6; such that for any B = (b,, --- , b,)' © E” with a; S b; S B; for 
t= 1, ++: ,n, tf x satisfies equation (1) then y; S x; S 6; fori = 
| Po 


Proof of Theorem 5: We first prove a useful lemma. 


Lemma 1. Let f be a strictly monotone increasing mapping of I" onto 
itself. Let x, b, a, B be real numbers such that xf(x) S ab witha Sb SB. 
Then y S x S 6, where y = min {f ‘(a), 0} and 6 = max {f-'(8), O}. 


Proof: Let a S b S 8B and define y = min {f ‘(a), 0} and 6 = 
max {f '(6), 0}. Let x satisfy af(x) < xb. Then x(f(x) — b) S 0. Clearly, 


y <0 S é6and hence if x = Otheny S$ z S 6. If x > 0 then f(x) S 
b < B which implies x S f'(@) S 6and hencey $0 <2 S 6. If 
x < 0, then f(x) = b = a which implies x = f ‘(a) = y and hence 
yoo< 0S o> O 


(Proof of Theorem 5) Since A ¢ Py there exists k, « {1, --- , n} such 
that 2,,(Ax),, 2 0 and hence, 


©.0ig = Ce.fe,(2n,) Fb Oy (AD) 2 Cafu, u)> 


Thus, by Lemma 1, there exist y{? =y,? (fx, , a,) and 6:2? = 6,? (fe, » Ber) 
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such that y{? < a, < 6{?. Now if F,-1 denotes the mapping of E””’ 


onto E”~* defined by 


Fi = (fi(-), peta fra); Feral); et es ele 


if A,-, denotes the (n — 1) X (nm — 1) matrix obtained from A by 
deleting the k,-st row and column (note that A,_, © Po), if 


t 
Qn-1 = (Qi, 2, aaa | Qk,-1.ks ? Qiy+iika 970 y An. ks) ? 


and if 
Be poet Be a De ay eR LY 

then 
Foaila) Agee = Boa S Wee 


Since A,-, e Po, there is a k. « {1, --- , k,—1, k, +1, --- , n} such 
that x,,(A,-1”),, 2 0 and hence, as before, 


Lx, (0z, (<= Qi, k:Le,) = es 


But, if ke. kx < 0, then 


(1) (1) 
Qk, — Oke kiVky = bi, =, Oks ki Uk, = Bx, =, An, ks 9g, ? 


and if az,.., > 0, then 


(1) (1) 
Ak, Oks ks OK, s by, — Gk, ki VR, ss Bx, — Qe kiVey - 


Therefore, by Lemma 1, there is a yj? = yi2 (fess On, — GhsmsVer) 


and 6:2) = 6:2 (fis> Bes — Gis.x,5,.) such that y,? S a, S oi, if 
Qx,.x, <= 0, and similarly for a,,.., > 0. 

The above process may be repeated successively until the n pairs 
of real numbers y,)’, 6,7, (@ = 1, --- , n) have been obtained. Thus, for 
any given B with a; S b; S 6; forz = 1, --- , n, the components of 
the solution x of equation (1) will be bounded by these pairs of numbers, 
provided it is known at each step which coordinate k; to choose. The 
appropriate coordinate choice, however, will in general depend on the 
particular solution x which is associated with the given B. For different 
input vectors B the appropriate choice will in general be different. There- 
fore, in order to obtain bounds on x which are valid for all B with a; S$ 


b; SB; @ = 1, --- , nm) we must consider each of the n! permutations of 
the coordinates {1, --- , n} and, for each one, generate the set of bounds 


fy, {8:4 = 1, -+., n} fory = 1, --- , n!. We then define y,; = 


* In this equation x is understood to be (m1, +++ , 2-1, Lar, *°* » In)4 
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min {y{?:» = 1, --- , nt} and 6; = max {6{?: » = 1, --- , nt} for 
t= 1, ---,mn. Then, for each B witha; S$ b; S 8; fori = 1,---,n, 
we have that y; S 2; S 6; for? = 1, --- , n, since at least one of the 
sets of bounds {y{?, 62:7 = 1, --- , m} must always apply. O 

If the matrix A of Theorem 5 satisfies a stronger condition than 
A e Py (that is, if A satisfies a weak row-sum dominance condition), 

a2 Dlag|, for ¢=1,--+,0, 
jrt 

it is possible to use a method that requires much less computational 
effort than that of Theorem 5 to compute the vectors y and 6 whose 
components bound the corresponding components of the solution of 
equation (1). This method of computing the bounds, a straightforward 
generalization of an idea presented in Ref. 3, is explained in Appendix B. 

T'rom Theorems 3 and 4 we now have the result: Every bounded 
input sequence B’, B’, B*, --- is mapped by equation (1) into a bounded 
output sequence x, x°, x°, «+ , for each F e &", if and only if Ae Py. 

In the proof of Theorem 5, the number of real numbers y;7, 6” 
which must be computed, in order to determine bounds for zx, is 2n X 
(n!). At the expense of obtaining poorer bounds it is easy to reduce 
this number to 2n’. Suppose we compute, at the first step, the 2n 


numbers oe ee ata Wes a and set \, = min ye 3 “ott as 
py, = max {6{?, --- , 6}. Then, for each B with a; S b; S 8B; for 
t= 1, --- , 7, one of the components of the corresponding x will be 


bounded by }, (from below) and y, (from above). We next compute the 
2n numbers y{” = ¥{? (fi, e: — ps), 6? = 6 (f;, 8B: — gi”), where 
ps? = max {as;\1, Qijui 27 % 1}, gf? = min {a,;\; , @;j;41 2:9 ¥ 7}, and 
denote the smallest y‘?? by \, and the largest 6{?? by wu, . Then we have 
bounds which apply for two of the components of the x which 
corresponds to any B with a; S b; S 6; fort = 1, --- ,n. By computing 
WW = IVP i, ae — DP? — pP), b = 5(F:, Bs — a3? — gi”), ete, 
the above process may be continued to obtain the numbers \, , ---, A, ; 
Hi, *** » Ma. Hach component of the x corresponding to any B with 
a; Sb; SB; fort = 1, --- , n will be bounded by \ = min {\,,--- , Ay} 
(from below) and » = max {y,, --- , wn} (from above). 

A matter that is closely related to the proofs of the above theorems 
on the boundedness of solutions of equation (1) is that of proving: For 
each F ¢ S" and each A ¢ Py the solution x of equation (1) ts a continuous 
function of the vector B. It is obvious that it will suffice to prove that 
for each F e« S$” with F(6) = 6, and for each A ce Py, the solution x 
of equation (1) is continuous in B at B = 6. We then note that if f 
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satisfies the hypotheses of Lemma 1 and, in addition, if {(0) = 0 then, 
due to the continuity of f-*, for every « > 0 there exists ¢ > 0 such that 
if a, 8 in Lemma 1 satisfy -—'§ <a Sb S 6B < & theny, 6in Lemma 1 
satisfy —e << y S$ x S 6 < «. This observation may be used to incor- 
porate a simple ‘‘e-5 argument”’ into the steps of the previous paragraph 
to show that when F(6) = 6 then for arbitrary e > 0, one can determine 
¢ > O such that || B || < ¢ implies || x || < «. 

At this point we return to the matter of the existence and uniqueness 
of solutions of equation (1). We state first a theorem of R. 8. Palais 
(Ref. 7—see also the Appendix of Ref. 8) which shows the connection 
between the concepts of existence and uniqueness of solutions and the 
boundedness of solutions. 


Palais’ Theorem. Let f;, --- , f, be n continuously differentiable real 
valued functions of n real variables. Necessary and sufficient conditions 
that the mapping f : E” — E” defined by f(x) = (fi(x), «++ , fa(z))’ be 
a diffeomorphism of E” onto itself are: 


(2) det [of;/dx;] never vanishes. 
(12) limyete || f(x) || = ©. 


Palais’ Theorem may be used to prove a result which is almost 
equivalent to our Theorem 8, that is: 

Theorem 6. If A is ann X n matrix then there exists a unique solution 
of equation (1) for each F = (f,(t,), +++ , fa(ta))’ with continuously 
differentiable, strictly monotone increasing functions f; which map E’ 
onto itself, and whose slopes are everywhere positive, and for each B ¢ I”, 
af and only if Ae Po. 


A proof of Theorem 6 which is independent of our Theorem 3 is 
easy to construct: For all A ¢ Py , the rather trivial Theorem 1 guarantees 
that condition (2) of Palais’ Theorem is satisfied, and Theorem 5 
guarantees that condition (zz) is satisfied. If A ¢ P) then a choice of F 
such as is specified in the “‘zf”? part of the proof of Theorem 4 provides 
a case in which condition (72) of Palais’ Theorem is violated. 


VI. SUFFICIENT CONDITIONS FOR A eP, or P 


For a given matrix A, it is not in general an easy task to determine 
whether or not A satisfies any one of the four equivalent conditions 
of Fiedler and Pt&k which are given in Section III and which serve to 
define the class of matrices Py (or the conditions which define P). This 
is particularly true when the order of A is large. For this reason, we 
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now give several conditions which are sufficient to insure that a matrix 
A isin P, or P (and which are not so difficult to verify). 

Suppose it were known that every eigenvalue of A as well as every 
eigenvalue of each principal submatrix of A had a nonnegative (positive) 
real part. Then this would guarantee that A « Py (P). This is the main 
idea involved in the following theorem. 


Theorem 7. If any one of the following inequalities is satisfied by the 
elements a;; of the matrix A, foralli = 1, ---,n,thenAeP,. 


(1) an =O, | a; os age OS a 1 
pi kt 
(it) Gis = a’ | ai; ie es P = i FS p + q* = 1, 
j#i 


a; positive numbers satisfying >> (1 + a;)"' < 1; 
i=l 


(ivi) a;; 2 a max |a;; |, a positive satisfying 
prt 


n 


> (2 | as | (max | as; )"} = allay (0/0 = 0). 
If any one of the above inequalities with = replaced by > ts satisfied for 
t=1,---,n,thenAecP. 


Proof: If the right-hand side of any of the above inequalities is 
denoted by the nonnegative number 7; then it is well known that all 
of the eigenvalues of the matrix A are contained in the union U{C; : 
7=1,---,n} of the disks C; = {z:|z—a,; | S 7;}.° But the condition 
a;; 2 (>) 7; guarantees that if z « C; then Re(z) = (>) 0. Thus, 
each of the eigenvalues of the matrix A has a nonnegative (positive) 
real part. The same is true of each eigenvalue of every principal sub- 
matrix of A, for if one of the above inequalities is satisfied by the 
elements of A it is also satisfied by the elements of any principal sub- 
matrix. O 


VII. COMPUTATION OF THE SOLUTION 


At present, the authors know of no single computational algorithm 
which is guaranteed to yield the solution of equation (1) for all F ¢ $’, 
AeP,,Be E”. However, there are several ways that the solution may 
be computed for large classes of such equations. 

If, for example, the matrix A satisfies either a weak row-sum or weak 
column-sum dominance condition (inequality (7) of Theorem 7 with 
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either a = 1 ora = 0) andif Ff ¢ §” with, roughly speaking, the slopes 
of each f; bounded from below by some positive constant, then it can 
be shown (see Appendix A) that an algorithm for computing the solu- 
tion can be obtained by the use of Banach’s contraction-mapping 
fixed point theorem. 

If the matrix A is positive semidefinite then, as mentioned in Sec- 
tion I, the existence of a unique solution of equation (1) for all F « s” 
follows from the earlier work of Sandberg and Minty. If, in addition, 


there exists for 7 = 1, --- , n, positive constants a; and 8; such that 
u—v 


for all u ¥ v, then Sandberg’s iteration scheme (also resulting from 
an application of the contraction-mapping fixed point theorem) can 
be used to compute the solution.’ In this regard, if the techniques 
of Section V are first used to obtain bounds on the location of the 
solution then one could modify equation (1) by changing the nature 
of the functions f; outside the domain in which the solution is known 
to lie (but still keeping the f; strictly monotone increasing from E’ 
onto EZ") and obtain a new equation which has the same solution as 
the original equation. By doing this, the functions f; in the new equa- 
tion might be made to satisfy the above inequalities in cases where 
this was impossible for the original f;. Also, even if these inequalities 
could be satisfied for the original equation, larger values of a, and 
smaller values of 8; might be used for the modified equation. This can 
result in a more rapidly converging iteration process (see Section VII 
of Ref. 3). Similarly, the bounds can be used to improve the perform- 
ance of other iteration schemes. 

In case A ¢ Py is not positive semidefinite, it might be that there exist 
diagonal matrices A, , A, > 0 such that A,AA, is positive semidefinite. 
If such matrices can be found, then Sandberg’s iteration scheme could 
be used to compute the solution of the equation 


A, F(A. x) + A, A Aor = A,B, 


from which the solution of equation (1) may be obtained directly. 
Unfortunately, it is not the case that such A,, A, > O exist for all 
A « P,. For example, it is quite easily verified that for 


a=! | 
1 0 
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even though A « P,, the matrix A,AA, is not positive semidefinite for 
any choice of A, , A, > 0. 
It is easily verified, however, that appropriate A, , A, > O can be 
found for all 2 X 2 matrices A ¢ Py except those for which 
(2) Q11QA20 = 0, 


and 


(72) Ay2Q2, 0, 


and 
(22t) either a). ¥ 0 or a, ¥ 0. 


In particular, for all nonsingular 2 X 2 matrices A © Po, appropriate 
A, , Az (Az = I) can be found. Thus, Sandberg’s iteration scheme could 
be used, for example, to compute the solution of the example problem 
of Section II which was taken from Calahan’s book. 


VIII. APPLICATION TO EQUATIONS FOR TRANSISTOR NETWORKS 


In this section some of the above theory is applied to the equations 
which describe the behavior of electrical networks containing tran- 
sistors. By the word transistor we refer to the three-terminal device 
whose equivalent circuit is shown in Fig. 2.* Considering the tran- 
sistor as a nonlinear two-port network, the following equations which 
express the port currents in terms of the port voltages follow imme- 
diately from inspection of Fig. 2: 


-| 1 a lia: 

—Qo1 1 fe(v2) 

We assume, as is the case for the usual large-signal model of a physical 
transistor, that 0 < aj. < 1,0 < a. < 1, and that both of the func- 
tions f, and f, are continuous and strictly monotone increasing. The 
character of the functions f, and f, which describe the transistor’s 
nonlinear conductances will depend on whether the transistor is des- 
ignated as NPN or PNP. We shall, however, have no occasion to dis- 


tinguish between these two cases in what is to follow. 
Suppose an electrical network is synthesized by connecting together, 


a 








te 


*TIn some respects this equivalent circuit is an ideal model of a transistor. 
Nevertheless, since this model is often used in the design and the computer 
analysis of transistor networks, consideration of it is important. The presence of 
series resistance at the base, emitter, and collector terminals of a transistor will 
be considered by the authors in another paper. 
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i2| Q COLLECTOR 





it | O EMITTER 


Fig. 2— The equivalent circuit of a transistor. 


in an arbitrary manner, any (finite) number of transistors, resistors 
(that is, linear resistors with nonnegative resistance) , voltage sources, 
current sources, and nonlinear resistors which are described by strictly 
monotone increasing conductance functions (and which we shall hence- 
forth refer to as “diodes”). Suppose the network contains 7 transistors 
and d diodes. For k = 1, -::, n, let ro,_1, Yen, Your—1, and Yo, denote 
the voltage and current variables v1, vz, 1, and %, respectively, for 
the k-th transistor. For k = 1, ++, d, let ton43, and Yon+x% denote the 
voltage across, and the current through, the kth diode. Let these 
variables be related by Yensn = fon+n(Lentn). Then, if = (1%, ++, 
Tan+a) *and i ie (yi, a) Yon+a) *, we have 


y = TF), (9) 


where T = diag(7,, T2), with T; a block diagonal matrix with n 2 x 2 
diagonal blocks of the form 


1 2A0 | 
? 
—a;; 1 | 


and T2 ad X d identity matrix. The nonlinear function F has the 
form F(a) = (fx (x), Cogs fonta(Von4-a) )?. 

Consider now the (2n+d)-port network of resistors and independ- 
ent sources which is formed from the original network by removing the 
transistors and diodes. If the y-parameter matrix G of this (2n+d)- 
port exists then we have the additional equation relating the vectors 
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xand y: 


where wu is some vector of constants which is, in general, nonzero since 
sources are present in the (2n+d) -port. 
Combining equations (9) and (10) we obtain 


TF(x) + Gx = u. (11) 


Now T is a nonsingular matrix and hence, if equation (11) is multiplied 
by 7’, we obtain an equation having the form of equation (1). If 
the matrix 7” ’G e« P, then, by Corollary 1, there exists at most one 
set of transistor and diode voltages satisfying equation (11). Moreover, 
if each of the nonlinear functions describing the transistors and diodes 
in our network maps EZ’ onto E’, or if T~'G e« P, then Theorem 3, or 
Corollary 3, guarantees the existence of a unique solution of equation 
(11). 

We have been careful to distinguish between the case when our 
theory guarantees only the uniqueness of a solution and the case 
when it guarantees both the solution’s existence and its uniqueness 
for the following reason: In the analysis of transistor networks the 
nonlinear functions which are used to describe diodes or to describe 
the nonlinear conductances in the equivalent circuit of a transistor 
are often taken to be of the form 


f(x) = In(e* — 1), 


where J, and ) are constants. The range of such a function is not the 
entire real line. Presumably, therefore, one can construct transistor 
networks having the property that if functions of the above type are used 
in a transistor’s equivalent circuit then the network admits no solution. 
We now give a simple example of such a network. We wish to emphasize, 
though, that even for these networks whose equations may sometimes 
have no solution, our theory still guarantees that if 7~’G « P, and if a 
solution of equation (11) exists, then it is unique. 

Consider the network of Fig. 3. For this network, equation (11) 


becomes 
“ ) 
I, 


oe. ce a 

jee 1 fo(v2) ~GJ Gg Ave 
Suppose aye = 0.5, ae: = 0.9, and g = 5.5 mhos. Then, the above equa- 
tion is equivalent to 
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fix) 5 —5} Io, 1/20 10 
ae ar 
f2(v2) _ 4 1 1 Vo 18 20 
Hence, v, and v, must satisfy 


fiQ:) + 5fe(ve) = 10. + J). 


If we now assume that the transistor’s nonlinear conductances are 
described by the functions 


fie) = -—Lé™" — 1), 
fo(ve) = —I(e"" =e). 


where the parameters J,, J., \,, and i, are each positive, then for 
all v, , v2 we have 


he 
I, 











fi(v,) + 5fo(v) <I, + 5I,. 


Hence, if the values of the independent current sources of Fig. 3 are 
chosen such that 


I+, >of. + 4/., 


then the equation for this network has no solution. 

Let us now consider the problem of determining whether or not, for 
a given network, the matrices 7 and G in equation (11) satisfy the 
condition 7~’G « Py (or T-’G ¢« P). (The existence of many transistor 
bistable circuits assures us that this condition is not always satisfied.) 





Fig. 3 — A transistor network whose equations may have no solution. 
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There is a large class of networks for which this condition is satisfied, 
and for which a simple inspection of the G matrix suffices to identify 
a member of the class. 

Since the matrix 7 satisfies a strong column-sum dominance condi- 
tion, that is, since 

es | Bel for t=1,--- ,2n+d, 
Peak! 
the following theorem guarantees that if the matrix G also satisfies a 
strong column-sum dominance condition, then 7~’G e P, and that if 
the matrix G satisfies a weak column-sum dominance condition, 
gi Z Ss | gi |, 
irét 

then JT 'G e Py and, hence, the above conclusions concerning the ex- 
istence and the uniqueness of a solution follow. 


Theorem 8. If the square matrix A satisfies a strong column-sum dom- 
tnance condition and tf the square matrix B satisfies a weak (strong) column- 
sum dominance condition, then A~*B ¢ Py (P). 


Proof: Suppose A~*B ¢ P, . Then, by the main result of the ‘only if” 
part of the proof of Theorem 8, there exists some diagonal matrix 
D > Osuch that det (D + A~*B) = 0. But det (D + AB) = det (A7’). 
det (AD + B), and det (A~’) # 0. Likewise, det (AD + B) #0 
since AD + B satisfies a strong column-sum dominance condition. 
Hence, A“Be Py. 

With B strongly column-sum dominant, let 6 > 0 be such that 
B — 6A also possesses the strong dominance property. Suppose that 
A7'B — 6I ¢ P,. Then, as above, there is a D > 0 such that A™’B — 
61 + D = A™[B — 6A + AD) is singular, which is a contradiction. 
Therefore A~*B — 6I « Py, and, by Theorem 1, A’Be P. O 


IX. COMMON~BASE TRANSISTOR NETWORKS 


We now consider a special class of the networks which are comprised 
of transistors, resistors, diodes, and independent sources. We consider 
the class of all such networks for which there is a single node (called 
ground) to which the base terminal of each transistor is connected. 
Let us first consider a subclass of this class of networks; that is, let us 
temporarily assume that no diodes are present. For all networks in 
this subclass it is easily verified that when the G matrix for equation 
(11) exists, then it satisfies the above weak column-sum dominance 
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condition and hence, by Theorem 8, 7-’G e« Py. This fact is made 
evident if we consider the network of resistors which is described by 
G (that is, the linear multiport to which the transistors are connected, 
with all sources removed) and first simplify this network by using the 
star-mesh transformation to remove all internal nodes. Of course for 
many networks of this subleass G is strongly column-sum dominant, 
in which case T’G ¢ P. 

It is clear that the networks for which the G matrix fails to exist 
are exactly those networks in which either one or more of the collec- 
tor or emitter terminals are connected, through the resistor network, 
directly to ground (that is, through a branch having infinite con- 
ductance), or else two (or more) of the transistors’ collector or emitter 
terminals are connected directly together (through a branch of the 
resistor network having infinite conductance). These direct connec- 
tions can exist in the resistor network either because of corresponding 
short-circuits in the original linear multiport, or because of corre- 
sponding connections involving branches which contain only ideal 
voltage sources. 

If one assumes that each transistor in the network has a nonzero 
series resistance associated with both its emitter and its collector 
terminals (this assumption certainly being consistent with physical 
reality) then one need not be concerned about the possibility of the 
nonexistence of the G matrix since the situations mentioned in the 
previous paragraph cannot occur. We now show, however, that one 
need not rely upon this assumption in order to prove the uniqueness 
of the solution of the equations which describe the networks that we 
are considering. 

We have observed that the matrix G will not exist if and only if 
the linear multiport has fewer independent port voltages than it has 
ports. In this case we modify the nonlinear multiport in such a man- 
ner that we can break some of the connections to the linear multiport 
so that it then possesses a G matrix and hence can be described by 
an equation having the form of equation (10). The modifications 
to the nonlinear multiport which are called for are obviously the ad- 
dition of voltage sources between certain nodes, the values of these 
sources being the same as those of the voltage sources connecting the 
corresponding nodes in the linear multiport. This simple concept is 
illustrated in Fig. 4. Here, the network of Fig. 4a, containing a linear 
6-port, has been replaced by the “equivalent” network of Fig. 4b con- 
taining a linear 3-port. Although the G matrix of the 6-port does not 
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Fig. 4 — Example of a grounded-base transistor network. 


exist, it does exist for the 3-port which can be described by 


‘, to b- Ose: a 
Geet, Be SOillae! ae |) a. 
is 0 O Ll, 0 


We have shown that the above artifice allows an equation having the 
form of equation (10) to always be written to describe the linear 
multiport contained in our network. We now show that an equation 
like equation (9) can be written to describe the nonlinear part of our 
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modified network. The equation which we obtain is of the form y = 
PTF (Ptx + C) with P an m X 2n matrix (m < 2n) and C a 2n-vector. 

Consider the equation which describes the nonlinear part of a com- 
mon-base transistor network before any of the above-mentioned 
modifications (that is, the addition of voltage sources) are made. 
This equation has the form of equation (9) with 7 being a 2n x 2n 
block diagonal matrix (recall that n is the number of transistors 
present). Let us consider the effect on this equation of the modifica- 
tion of the network by adding voltage sources, one at a time. There 
are two different ways of adding voltage sources that must be con- 
sidered. 

Suppose a voltage source of voltage EH is connected between nodes 
j and k (with plus reference at node 7), and suppose the connections 
between node 7 and the linear multiport are then open-circuited. This 
situation is illustrated in Fig. 5. Using the notation indicated in this 
figure, we have 


°, 
~ 
I] 


TF), 


=U for v # j,k, 


1; = 0, 
%=%~+u, 
v, =X, + E. 


Let us now define the vectors v* and 7* to be the (2n—1)-vectors 
obtained from v and 72, respectively, by deleting the v; and 2; elements. 
Then, if F*(v*) is the 2n-vector obtained from F(v) by replacing the 


i" ae ( i 





Fig. 5 — Typical modification of the nonlinear multiport network. 
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argument v; by v, + E, we then have that 
CS Tw), (12) 


where the (2n—1) X 2n matrix 7* is obtained from TJ by adding the 
j-th row to the k-th row and then deleting the j-th row. Observe that 
T*F*(v*) can be written as QTF(Q‘v* + R) in which the j-th element 
of the 2n-vector R is E, all other elements of R are zero, and Q is obtained 
from the identity matrix of order 2n by adding the j-th row to the k-th 
row and then deleting the j-th row. 

In case a voltage source of voltage H is connected between node j 
and ground (with the plus reference at node j) and all connections 
between node j and the linear multiport are open-circuited, then we 
can again form equation (12) from equation (9) by simply replacing 
v; by E wherever it appears in the argument of F, to form F*(v*), 
and deleting the j-th row of the matrix 7, to form J*. In this case 
T*F*(v*) can be written as QTF(Q‘v* + R) in which R is as defined 
earlier, but in this case Q is obtained from the identity matrix of order 
2n by simply deleting the j-th row. 

The above processes can be applied repeatedly to account for the 
addition of an arbitrary number of voltage sources to the nonlinear 
multiport. The resulting equation which describes the multiport will 
have the form 


y = Q, +++ Q.Q:TFQ:Q ++: Q2 + C) 
= TF (2) 
with C some constant 2n-vector and each of the Q; obtained from the 
identity matrix of the appropriate order in one of the two ways described 
above. 
Consider equation (9) in which T is a square matrix. Due to the strict 
monotonicity of each component function of F, the mapping TF (zx) 


has the following property: If p, q are arbitrary 2n-vectors then there 
is a diagonal matrix D > 0 such that 


TF(p) — TF@) = TD(p — q); (13) 


and furthermore, the matrix TD is strongly column-sum dominant 
(since 7 is strongly column-sum dominant). We now wish to show that 
a similar fact is true in the more general case. 

With m the number of rows of Q,, let p and gq denote arbitrary 
m-vectors. Then since there is a diagonal D > 0 such that 


F(Q:Q2 +--+ Qp + C) — FQiQ: --- Q9 + C) = DQiQ2 --- (p — 9); 
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we have 
TF (p) rx TF(q) = Q, °°: Q:Q:T DQ:Q: «+: Q:(p — 9). 

The fact that Q, --- Q:0,7DQiQ; --- Qj) is strongly column-sum 
dominant follows from the very easily verified proposition that the 
product Q,MQ; (k = 1, 2, --- , p) possesses that property whenever 
M does. 

Therefore if x’ and x* denote two solutions of the ‘‘generalized equa- 
tion (11),” then [TD + G](z’ — 2’) = 6 in which 


T=Q,:::Q@.0,7T and D= DQiQ,---Q. 


But TD, and hence TD + G, is strongly column-sum dominant and 
hence, nonsingular. This implies that x* = 2”. 

We have now shown that in any network constructed from resistors, 
independent sources, and transistors having a common-base connection, 
the transistors’ base-emitter and base-collector voltages are unique. It 
is a trivial matter to show that the same result applies when diodes 
are also allowed to be present in the network. 

‘Suppose the result was not true for some network containing at least 
one diode. Then there would be two different sets of voltages and cur- 
rents which satisfy Kirchoff’s laws. Thus for each diode in the network 
there would be two (not necessarily distinct) pairs of points (vf? , 7%), 
(vf , 7) at which the diode is biased, corresponding to each solution. 
Letting f denote the strictly monotone increasing function which 
characterizes the diode we have 7$? = f(v§?) and 7 = fio). But 
then, suppose the diode is replaced by the series combination of a 
resistor 7 and a voltage source EH whose values are chosen so that the 
line 7; = (1/r)v; — E/r passes through the points (v{” , 7§) and (v$” , 
if). (Due to the strict monotonicity of f, this can certainly be done 
with some positive choice of r.) Performing the above type of sub- 
stitution for each diode in the network, we obtain a new network of the 
type already considered. This new network would possess two different 
sets of transistor base-emitter and base-collector voltages (the same 
as before). This contradicts our previous result, and hence the previous 
result must apply, even when diodes are present in the network. 

To determine the equilibrium solutions of the differential equations 
which describe a network containing inductors and capacitors as well 
as the elements mentioned above, one must determine the solutions of 
a de equation for a network of the above class. Thus, in summary, 
what we have shown is: One cannot synthesize a bistable network 
which consists of resistors, inductors, capacitors, diodes, independent 
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voltage and current sources, and an arbitrary number of (Fig. 2) 
transistors having a common base connection (or, in particular, only 
one transistor). 
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APPENDIX A 


Algorithms for Computing Solutions of Equation (1) 


In this appendix two algorithms for computing the solution of 
equation (1) are presented. It is proved that one of the algorithms will 
always converge to the solution of equation (1) if the matrix A satisfies 
either a weak row-sum or column-sum dominance condition (inequality 
(t) of Theorem 7 with either a = 1 ora = 0) andif, roughly speaking, 
the slopes of each f; are bounded from below by some positive constant. 
In each case the proof of convergence relies upon Banach’s contraction- 
mapping fixed point theorem, and therefore also represents an inde- 
pendent proof of the existence and uniqueness of a solution of equation 
(1) for the conditions stated above. 

The following notation will be used: For fixed F « 5", B e« EH”, let 
f(x) = F(x) — B; also, if A isa givenn X n matrix with elements a,; , 
we define the diagonal matrix D by D = diag [a,, , doz, --- , Gaal, and 
Jet A= A — D. 


Theorem A. If then XK n matrix A satisfies 


a: 2 PRE rae for Gs Loe gh; 
THe 


and if F « 5", Be E", and tf there exists some « > 0 such that for each 
a, Bek',el|a—B| S| fila) — f:(8) | fort = 1, --- , n, then equation 
(1) possesses a unique solution, and if x° is an arbitrary point in E”, the 
sequence x, x, x, --+ defined by 


a* = (f + D)'(—A)a" 
converges to the solution. 
Proof: Equation (1) may be rewritten as 
f(z) + Dx + Ax = 6. 
Hence, if the operation T: E” — E" is defined by T = (f + D)7'(—A), 
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then the solution of the equation x = Tx is identical to the solution 
of equation (1). We now prove that the sequence x”, x’, x”, -- +» converges 
to this solution by proving that 7’ is a contraction. 

Let « and y be arbitrary points in £” and let g = Tz,h = By. Then, 
f(g) + Dg = —Az and f(h) + Dh = —Ay. Thus, for 7 = 1, nN, 


fig.) — 5: + ang, = —(Az),, 
and 
fi(hi) — 6; + ash; = —(Ay);. 
Subtracting, we obtain 
figs) — 1(hi) + aislgs — hi) = (Ay)i — (An), . 
Since f; is strictly monotone increasing, we have 
[fs(92) — fis) | + ase l ge — Ae | = | (Ax), — (Ay): |, 


and hence, since e + a,; > 0, 


La. — he | S| (a), — (Ay): 


Now, 
| (Az); — (Ay); | = | » ais(x; — Y;) | 
Ss 2 (1 a ap acl) 
Ss (2 | as; |) max yee ae 
Thus, defining the metric p on E” by p(z, y) = max |x; — y; |, we 
have, for? = 1, --- , n, , 





| gi — h; | ij |) p(z, y). 


IIA 


But, since 0 S$ > |a:;| < a:; + «, there exists K,0 < K < 1, such 


that | 9; Ki ‘p(x, y) fori = 1, --+- , n, and in particular, p(Tz, 
Ty) = max ae —h;| S K-p(z, iy: Gane T is a contraction. O 





Theorem B. If then X n matrix A satisfies 
G2 Dae, for t= 1, +++ ,n, 


Trt 


and if F « S", B « E”, and tf there exists some « > 0 such that for each 
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a,BeH',el|a—6| S| f(a) — f.(B) | fort = 1, --- , n, then equation 
(1) possesses a unique solution, and if z° is an arbitrary point in E”, the 
sequence z°, z’, 2", -++ defined by 

= —A(f + Dye 


k+1 
@ 


converges to some point z* and the solution of equation (1) is given by 
a* = (f + D)te*. 


Proof: As in Theorem A, the solution of equation (1) is also the solu- 
tion of e = (f + D)7'(—A)z. For each x « BE", let z = (f + D)x and 
hence « = (f + D)~*z. Thus, x* is the solution of equation (1) if 2* = 
(f + D)~*z*, where z2* is the solution of 2 = —A(f + D)~*z. The theorem 
is thus proved if it is proved that the operator T = —A(f + DD) isa 
contraction. 

Let P denote the operator (f + D)~*, and let x and y be arbitrary 
points in EZ”. Then, proceeding as in the proof of Theorem A, we obtain 


1 
* — ° < ne ee ee . a. . — eee 
\(Pa)i— (Pdi | So-pzle— wl for Gd yn. 
Thus, if g = Tx andh = Ty, then forz = 1, --: ,n, 


GS e a;;(Px); and h,=— oe a;(Py); . 
imi 


iwi 


Hence 
gi — h; | = | yz a,;((P2); = (Py);) | 
= > (| Qi; || (Px); — (Py); l) 
= (1 a3; Perera — U; )- 
Therefore, 


Slam ls DD tel |e, - wy 


= > (ydeel) ie, w. 


jai Ving G3; + € 


But, there exists K,0 S K < 1, such that, forj = 1, --- ,n, 


y ltl <x, 


imi Oj; “Pe 
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and hence 
n n 
Dlg — hk | SK Delay — yl. 
‘= 17= 


Defining the metric p on E” by 





pe, y) = De | as — uy 


? 


we therefore have 


a(x, Ty) = > lg: — hs | S K-p(z, y), 
and hence 7 is a contraction. O 
APPENDIX B 


Determination of Bounds on the Solution of Equation (1) 


In this appendix we present a method for determining bounds on 
the solution of equation (1) when F' eS", A is weakly row-sum dominant, 
and (for given a = (a,, °+:,a,)',8 = (6:, °°: ,B,)' eB") B= (b,, 
-++ ,b,)' satisfies a; S b; S 6; fori = 1, --- , n. The solution bounds 
are, in general, easier to compute than those of Theorem 5. The method 
presented here is a generalization of an idea presented in Ref. 3. 

The computation of the solution bounds proceeds in two steps. First, 


one solves each of the equations 


F(z) =a (14a) 
and. 
F(x) = B. (14b) 
Denoting the solutions of equations (14a) and (14b) by uw = (u,, 
“++ wa) and vy = (y,, +++, »,)’, respectively, and defining 
N= max{| mi], +++ [aol lal sss | eal}, 


and 
B= (> | Qi; lj Peg 3 | Qn I, 
il jn 
one then solves each of the equations 


F(a) + diag [ay ,--°+ 5 Gan] = a — dB’, (15a) 
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F(@) + diag [au , +++; Gunle = B + 0B’. ~ (15b) 


Denoting the solutions of equations (15a) and (15b) by y = (y,, 

-,Yn)' and 6 = (6,, --- , 6,)‘, respectively, one has y; S x? S 4; for 
i= 1, ---,n, where x’ is the solution of equation (1) that corresponds 
to any B satisfying a; S$ b; S$ B; for? = 1,---,n. 

To prove that the components of the vectors y and 6, determined by 
the above procedure, are indeed bounds for the corresponding compo- 
nents of the solution x° involves no more than a word-for-word repetition 
of the proof of Theorem 2 of Ref. 3, with several quite obvious modifica- 
tions. We omit the details. 
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Some Theorems on the Dynamic Response 
of Nonlinear Transistor Networks 


By I. W. SANDBERG 
(Manuscript received July, 16, 1968) 


Felative to the huge body of theory of linear time-invariant systems, very 
little of a general and precise nature ts known about the network-theoretic 
properties of transistor circuits operating under large-signal conditions. 
One basic property P which a transitor network might have is that tf the 
input approaches a constant, then the output approaches a constant which is 
independent of the initial conditions. In this paper we prove a stability 
theorem concerning a nonlinear differential equation that governs the 
behavior of a large class of networks. A corollary of this theorem asserts 
that tf a certain condition is satisfied, then property P holds. 

We consider also the problem of estimating the rate of decay of transients 
in transistor networks and we prove theorems which allow us to make some 
often quite conservative, but definite, statements concerning limitations on 
switching speeds. A practical example considered shows that in some cases 
the bounds, which are frequently very easy to evaluate, can be quite useful. 

The proofs depend in an interesting way on the relationship between 
the static diode characteristic and the nonlinear capacitance associated with 
a semiconductor junction. 


I. INTRODUCTION AND DERIVATION OF THE DIFFERENTIAL EQUATION 


We initially consider the network of Fig. 1, which contains transis- 
tors, linear resistors, voltage sources, and current sources. Each 
transistor is represented by a model of the type shown in Fig. 2 (see 
Gummel? and Koehler?) which takes into account nonlinear de proper- 
ties as well as the presence of nonlinear junction capacitances. Asso- 
ciated with this model are six parameters: ay, a-, Te, Te) Ce, ANA Cy (all 
positive constants; ay < 1, a, < 1) and two nonlinear functions f,(-) 
and Tol?) 

Concerning f,(-) and f.(-), for our purposes it is necessary to as- 
sume only that a ayy eng 
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Fig. 1— General network containing transistors, sources, and resistors. 


Assumption 1: For each transistor: f.(-) and f,(-) are strictly-mono- 
tone increasing mappings of the real interval (— 0, o) into itself; 
f-(0) = f,(0) = 0, and f-(-) and f,(-) are continuously differentiable 
on (—o, 0). 
The functions f,(-) and f,(-) of Gummel’s model' are of simple 
exponential type and satisfy Assumption 1. 

From Fig. 2: 


i = Flee + rel] + flv) — afeloo) 


ic = Z toa, + rhlv)l — ail.) + fel. 


Suppose that the network of Fig. 1 contains p transistors; for k = 
1,2, +++, p, let ve,_1 and vo, respectively, denote the emitter to base 
voltage and the collector to base voltage of the kth transistor. Simi- 





Fig. 2— Transistor model. 
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larly, for k = 1, 2, --:, p, let 22,1 and %;, respectively, denote the 
emitter current and the collector current of the kth transistor (with 
reference polarities as indicated in Fig. 2). Then, with v = (v1, Va, 

- ) Vop) t= (t, 12, °** 5 tap), fox1(+) and Cox the f,(-) and c, of 
the kth transistor, and fo,(-+) and Co, the f,(-) and c, of the kth tran- 
sistor, 


i= £(c@)] + TFC) (1) 

Where: for? Ss eS Dp. 
[CON], = 60; + rif) (2) 
FO; = fies), (3) 


and T= 7,@T.@---@T,, the direct sum of p 2 X 2 matrices 


T, in which 
1 —at” 
—at" 1 | 
fork = 1,2, +--+, p. 


We assume that the linear resistive portion of the structure of Fig. 1 
introduces the constraint 


‘a 





t= —-Gv+B (4) 


in which G is a conductance matrix and B is an element of the set ® 
of all real bounded continuous 2p-vector-valued functions of ¢ on [0, ~). 
From equations (1) and (4) 


© (C@)] + PFO) + Go = B. (5) 
Let wu = C(v). Since all of the c; and 7; are positive, and each of the 
f;(-) is continuous and monotone increasing, there exists a C”'(-) such 
that v = C7*(u). Thus, 
du -1 -1 
at PPIC WI + GC @) = B. (6) 


‘The Jacobian matrix J, of TF[C™*(u)] + GC7*(u) is 


. filgi(u;)] : 1 
Pou te + aw) as tr = aan 
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in which for all 7 = 1, 2, --- , 2p 
gi(u;) = [C*(u)]; 
with each of the g;(-) continuously differentiable. 
Since J, is continuously dependent on u, and || J. || (|| - |] any norm) 


is bounded from above uniformly in u, it follows that there exists a 
constant DZ such that 


|| TF[C™*(w.)] + GC~*(u,) — TF[C™(w)] — GC™*(w,) || 
<L\lu.— w|| (7) 


for all u, and uw, belonging to real Euclidean 2p-space H’’. In particular, 
we have 
|| TF(C(u)] + GC“(u) — B\| S$ Lil ull] + |B Il (8) 


for allt = 0 and all we LE”. Therefore (see, for example, Nemytskii and 
Stepanov’), for any initial condition uo e E””, there exists a unique con- 
tinuous 2p-vector-valued function u(-) such that u(0) = uw) and equation 
(6) is satisfied for all t > 0. In other words, under the assumptions we 
have introduced, it makes sense to study the properties of the solution 
of the equation 


ML PrC w+ Gie"@) =B, 120 uO)=u) ©) 


Ii. STATEMENT OF RESULTS, AND EXAMPLES 


We need the following definitions. 

Definition 1: A real matrix M of arbitrary order n is strongly col- 
umn-sum dominant if and only if for allj = 1,2,...,n 

Miz — d, | m; | > 0. 
tq 

An important property of T' is that it is strongly column-sum dominant. 

Definition 2: We shall say that a real matrix M of order 2: is an element 
of D if and only if there exists a diagonal matrix diag (d, , d,, +--+ , doy) 
with each d; > 0 such that 


doy 1 
ay” << hace < ot 
fork = 1, 2,--- , p, and diag (d, , d., --+ , ds») M is strongly column- 


sum dominant. 
Our main result* concerning equation (9) is: 


* Proofs of all results in this section are given in Section III. 
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Theorem 1: If Ge D, and u,(-) and us(-) satisfy 
ot + TF[C'(u.)] + G[C™(u.)] = B.(2), i=0 (10) 


du 4+ TFIC(u)] + GIC@)] = Bl), 120 (11) 


with B, e ® and B, « B, and if [B,() — B,(t)] — 4 (the zero vector of E*”) 
ast—> ©, then [u,(t) — w(t)] > dast— ~. 
An interesting corollary of Theorem 1 is 


Corollary 1: Referring to equation (9), if Ge D, and if there exists 
a constant vector B,, such that [B(t) — B,] — @ as t — ©, then there 
exists a constant vector u,, such that [u(t) — u,,] ~ @ast— o, and u,, is 
independent of the initial condition u, . In particular, if B. = 0, then 
Un = 0. 


It is interesting to observe that G « D whenever the base leads of all 
transistors are connected together and there is a resistor between the 
emitter and base, and between the collector and base, of every transistor, 
for then G is strongly column-sum dominant. Also it is easy to give 
examples of conductance matrices which are not strongly column-sum 
dominant, and which belong to 9. For instance, for the network of 
Fig. 3. 





Fig. 3 —Single-transistor network. 


G= 





Ja + gr ‘ 
— 9 Jo 


and diag (d, , d.)G is strongly column-sum dominant for d, = 1 and 
some d, such that 
1 * 
as < d, < 
a 


r 


* More generally, G of order 2p with positive diagonal elements belongs to D 
whenever it is possible to obtain a strongly column-sum dominant matrix from 
G by adding an arbitrarily small positive quantity to a single diagonal element. 
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Fig. 4— A two-transistor circuit. 


As another example, consider the circuit of Fig. 4, for which 
473 —10 10 —ll1 
1/-10 473 -11 10], 
10 —Il1 11 -—10 
—1l1 10 —10 11 


Since diag (1, 1, 22, 22)G is strongly column-sum dominant, G ¢ . 
Finally, for the network shown in Fig. 5, 


i140. 10° 211 
iS. ate “Aol 
AP) 496, .<2ak. SH a0 

St 10-10. 41 


In this case, G is obviously singular and hence does not belong to 9. 
Suppose that the source current of Fig. 5 7)(¢) is a constant and that the 
transistor functions f:(-), fe(-), fs(-), and f,(-) are all bounded from 
below by the constant 6 (this is certainly an assumption consistent with 
our earlier assumptions and with the character of transistor models 


G= 


— 


in 


iottf 


Fig. §— Transistor circuit for which the de equations may have no solution. 
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ordinarily used.) We wish to show that here for sufficiently small 7 , 
there does not exist a constant vector wu, such that [u(é) — u.] — 6 
ast— oo, 

Suppose that u(t) > u,, a constant vector, as £ — o. Then there 
would exist a 2p-vector v,, such that u, = C(v,.) and 


TF(v.) + Gv. = B 


with B = (zy , 0, 0,-- -, 0)*". Let 7 denote the 2p-row-vector (1,1, 1,---, 1). 
Then 
TF (v.) + 7G. = nB. 


But 7Gv,, = 0, and hence 
io = DL = a YrilCantnn) + DU — a! Yaron) 
which does not possess a solution »v,, if 
ip <b pa ice a Se, 


2.1 Estimation of the Rate of Decay of Transients 


Theorem 2: If the hypotheses of Corollary 1 are satisfied with B(t) = B., 
fort = 0, then 

2p 2p 

Dd; | u,() — wo; | S exp(—Ki) Did; |usO) — us; |, 120 

i=l 7=1 


for every set of positive constants d, , dz, +++ , de, such that 


0<K = minmin ( (1 — djd;"a;), * (935 — Do did;* | 933 p} 

i i i ii 
in which —a; is the nonzero off-diagonal term in the jth column of T, 
and d; = djs for j odd and d; = d;-_, for j even. 


It is easy to show that G e« D implies that there are positive constants 
d;,j = 1,2, --- , 2p, such that K > 0. 

As an example of the application of Theorem 2, consider the problem 
of estimating the switching time of the single-transistor inverter circuit 
of Fig. 6 in which a, = 0.968, c, = 2 X 10°” fd, 7, = 1.7 X 10°” second, 
a, = 0.583, c, = 1.7 X 10°” fd, and r, = 2.62 X 10°* second. Here 
(in mhos) 

G= 1.1886 X 10°° —1.01215 x 10° 


—1.01215 X 10° 1.01215 x 10° 
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Fig. 6— Practical logical-inverter circuit. 


which takes into account a bulk base resistance of 280 ohms and a bulk 
collector resistance of 18 ohms. The circuit is initially at steady state 
with e(t) = 0.3 volt fort < 0. Foré¢ = 0, e(é) = 10 volts, and ast —> ~, 
u(t) — u., some constant vector. With d, = 1, the number K is the 
smallest of the four quantities: 0.58(1 — 0.968dy') X 10°°, 0.5(1.1886 — 
1.01215dy7') X 10°, 0.3815(1 — 0.583d,) xX 10°, and 0.58(1.01215) 
(1 — dy) X 10°. 

It is clear that d, must satisfy 0.968 < d, < 1 in order that K > 0. 
Then optimal choice of d, (that is, the choice that yields the largest 
value of K) is approximately 0.9709. For d, = 0.9709, K = 1.66 X 10’. 
Let the “charge switching time” t, denote the smallest value of ¢ such 
that >>?_, | u;() — ue; | is less than or equal to two percent of >°?., 
| u;(0) — uw; | for all¢ = ¢,. Then our upper bound on ¢, is approxi- 
mately 4 X (1.66)7* X 10°” = 241 nanoseconds. The actual value of ¢, , 
as determined by numerically integrating the system of two nonlinear 
differential equations is approximately 57 nanoseconds. Thus, for this 
circuit, Theorem 2 provides a very easily evaluated and useful upper 
bound ont, . 

Finally, we state a result which provides an often rather conserva- 
tive but easily evaluated lower bound on the rate of decay of tran- 
sients, 


Theorem 3: With B a constant real 2p-vector, let 
du -1 -1 
ae TFC” (u)]) + GC) = B, t= 0. 


If there exists a constant 2p-vector u.. such that [u(t) — ux] > Past — «, 
then for any choice of positive constants d; ,j7 = 1,2, +++ , 2p: 


2p 2p 
Ds d; | w;(d) — Un; | = exp (—Ki) » d; | w,(0) — Uni l, t = 0 
j=1 7=1 
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in which 
RK 1 = 1 2 - 
= see max i (1 + djd;"a,), C; > didj" | gis } 
where —a; ts the nonzero off-diagonal element in the jth column of T, and 
d; = djs; for j odd, and d, = d;-, for j even. 

The arguments used to prove the results stated in this section can 
be modified in a straightforward manner to prove far more general 
results concerning networks that contain diodes, capacitors, and in- 
ductors, in addition to the elements of the structure of Fig. 1. Some 
of these more general results are described in Section IV. 


III. PROOFS 


3.1. Proof of Theorem 1 
We first show that 


F[C™*(u.)] — F[C™'(us)] = Diu. — w), t 2 0 (12) 


and - 
Co" (ua) = Co" (us) = D,(u, — w),t 2 0 (18) 


with D, and D2 diagonal matrices dependent on ¢ and possessing some 
special properties. 

For 7 = 1, 2, +++ , 2p, let gj(ua;) = [C4(ua)]; and g;(usj) = 
[C+ (u,) ];. Then, using equation (2), 

Uss — Usg = Ci19;(Uas) — Gi(Uos)] + ri ffilgs(tas)] — filgs(ei)]i- 
Thus if u.; ¥ Us; ; 

figs) — filgi@es)] = T(Uas_» Usi) 
Uay — Un; Cy + 7 ,7j;(Uaj » Uoi) ’ 
in which (for u,; # Uy;) 
_ Filgs(tas)] — filgs(os)) 
JiUai) — gi(Uei) 


In a similar manner we find that for all u.; 4 uw; : 


1 ;(Ua; ) Uj) 


Gi(Uas) — g(tbos) = 1 , 
Uay — Ud; C; + ira; , Uri) 


Now, let us define for j = 1, 2, --- , 2p 
| 1; (Uaz  Uss) = Fig; (as) 
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when t,; = Us;. Then since u,; and u,; are continuous on [0, ©), it 
follows (see Appendix A) that 7;(u,;, u.;) is continuous on [0, ©). 
Since 7; (ua; , Us;) is nonnegative, itis clear that both 


1 (Ua; _y Us;) 
C3 + Tyr j(Uaz y Usi) 


and 


I 
C; + ria; , Us;) 
are continuous on [0, ©). Moreover equations (12) and (18) are satisfied 
with 
D, = diag {ers td 1 Us) } (14) 


Par Ti (Ua; , Us;) 


ee eS (15) 


ce 7 (Ua; , Us;) 


At this point we have 
© uy =u) + (FD, + GD Jus — ws) =Be- By, 420° (16) 


with TD, + GD, continuous on [0, @). 
We need the following lemma. 


Lemma 1*: Let M(-) be a continuous real n X n matrix-valued func- 
tion of ¢ defined on [0, ©) such that there exist positive constants ¢ and 
Cy Coy *°* » Cay With the property that for] = 1,2, ---,nandallt=0 


Miz — dD C:C;" [m:;| Be. 
Fi 


Let x be a differentiable real n-vector-valued function on [0, ©) such that 


dx 
at Mt = 9, i= 0. 


Then there exists a constant k such that fort = 1,2, ---,n,andallt = 0 
| a(t) | S k exp(—e). 


Moreover, k depends only on the c; and the initial values x;(0). 


*In Ref. 4, Rosenbrock states a similar result, but does not give a rigorous 
proof. He considers the case in which c; = 1 forj7 = 1,2, +++ ,n. 
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Proof of Lemma 1: Let the functional s be defined in terms of an 
arbitrary continuously differentiable scalar function ¢(-) by 
s(v)(t) = 1 if gf >0 or if gf) =0 and g(t) >0 
—1 if off) <0 or if gf =0 and g(t) <0 
0 if g(t) =0 and g(t) = 0. 
Then for ¢ = 0, 
2 eis(xi)(D2i(f 


I 


— Leese) Ly mises 

— Lay Les (mi 

a 3s x,C;8(x,)()m;; — d x; » cs(x,)(t)mi; 
— Lesmis fas | + DO Las | Des | ms | 
—e d | ca; |. 


I 


lA 


IIA 


But >>; ¢;s(x;) (tx! is equal to - >>; | ca; |, the right-hand derivative 


of >»; | ca; | [see Appendix B; the derivative of |x; | need not exist 
at points ¢ at which z;(t) = 0]. Therefore 


d* 
a 2 |e |S —e Dy | see |, t=0 
a i 
from which it follows that 
X leei() | < exp (et) Lee) |, 20.0 


If M(-) satisfies the conditions of Lemma 1, then it is easy to show 
that the unique continuously differentiable n X n matrix-valued function 
X defined on [0, ©) which satisfies 


<< + MX =0, t=0 [X@) = J] 
possesses the property that (for any norm || - || on Z”) there exists a 
constant K, such that 
|| X@X(r)* || S K, exp [—e¢ — 7)] 


for allt = r. 
Returning now to equation (16), assume that [7’D, + GD,] satisfies 
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the conditions on M(-) of Lemma 1. Then with Y the solution of 
dY 
ap or [TD, + GD.|Y = 0, t20 [Y(0) = I] 
we have 
u(t) — w(t) = Y(é)[u(0) — u(0)] 
+f YOY@"BG® -—Bi@ldr, +20. 
0 
Therefore, for ¢ = 0 
[| w(t) — w(t) || S || Y@[u.(0) — w(0)]|| 
+ [POLO BO — BG) Ia 
< || VOL.) — u,(0)] || 
t 
+ K, I exp [—e(t — 2)] || Bar) — Buln) |] dr 
for some positive constant K, . Since || B.(r) — B,(r) || ~~ 0as7r— @, 
it follows that || w.(é) — u(t) || ~Oast— o. 


It remains only to prove that [7D, -+ GD,.] meets the conditions 
imposed on M(-) of Lemma 1. Since G ¢ 9, there exists a diagonal 


matrix diag (d,, d., --- , do) with d; > Oforj = 1, 2, --- , 2p and 
do, 1 
sae Fa 
for k = 1, 2, --- , p such that both 


diag (d,, dz, ++: , doy)G 
and 
diag (di, do, °-: , dey)T 
are strongly column-sum dominant. Thus for 7 = 1, 2, +--+ , 2p 


ti; — Di didj* | ti; | > 0 
if? 


933 Sad: | Jii | > 0. 


tei 
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Let W = TD, + GD,. Then, for 7 = 1, 2, --- , 2p, 


1 
win = Gis C; rary TUNG ar, + ry; 


and 
_ 1 
1 = a e 
2, did; | wis | = Do didi" eG: ae TMi a op 
Therefore 


~ ys dd; " [wi |= i (t3; — De dd; " | te; |) 


ii ey hw eer ii 


ae are -1 
ot c, + tH; (9ii » d.d;* | gsi |). (17) 





Since 7; = 0, the right side of equation (17) is bounded from below by 
some positive constant « uniformly int andj. O 


3.2 Proof of Corollary 1 
By Corollary 3 of Ref. 5 there exists a unique v e E”” such that 


TF(v) + Gv = B, (18) 


whenever G is such that all principal minors of 7~’G are positive. 
In Reference 5 it is proved that T~*G will have this property if T~’G can 
be written as A~’B with both A and B stongly column-sum dominant. 

Let H = diag (d, , d2, --+ , d2»)G be strongly column-sum dominant 
with all d; > 0 and 


a? < daa. 1 
& 
z doy, a,” 


fork = 1,2, ---,p. Then U = diag (d,, d,, --- , de,)T' is strongly 
column-sum dominant, and 7~’G = U~’H, which proves that equation 
(18) possesses a unique solution v. 

With v the solution of equation (18), let u.. = C(v). Clearly if B.. = 
then u, = 6. Let uw, satisfy 


Ms + PFIC™Qu)] + G[Cu)] = t=0 


with u,(0) = u,.. Of course, u(t) = u. for allt = 0. By Theorem 1, 
[u(t) — u.] > 0ast— o, independent of wu. 
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3.3 Proof of Theorem 2 
Following the proofs of Theorem 1 and Corollary 1, 


AC — Ue) + (TD, + GD2)(u - Un) = 0, $20 


in which 
aah aie 7 ;(U;_, Uni) 

Diag ts + ryr;(u; | 
and 

a 

ee C; ae rir ju; » Un; 
Therefore 
Xd |ul) — ues | S ED a; | 450) — ues | t=0 

in which 


kK = min min or (tj; — 2 did;* | tis |), t (9:3 — » did;* | gi: pp. 
i i ti i ixy 
But for 7 = 1, 2, --+ , 2p 
t;; — Dy did;*|t; | = 1— djdj'a;. 0 
Fi 

3.4 Proof of Theorem 3 

Since 7'F[C~*(z)] + GC™*(z) depends continuously on z ¢ EL”, wu, 
satisfies (see Ref. 6) 

TF(C™*(ue)] + GC'(u.) = B. 

Therefore, following the proofs of Theorem 1 and Corollary 1, 


AG — Uo) + (TD, + GD.)(u — uo) = 0, i=0 
in which 
Dd, a diag ‘ r (Us , Uco;) \ 


po ra (u; » Usoj 
and 


; 1 
D, = ding Heat 
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For any ze EZ”, let || z || denote >» d; |z; |. Then, fort = 0 





| Se — |] = en, + eD9@ — wo I 


1 (Uz 5 Us) 


< max ia + djd;*a;) C; + rr j(ty 5 Usos) 


2p = eee es _ 
+ > d,d; | Giz Is +. r,t (U; ' co) | u(t) : Us ||. 


But, since 7;(u; , U.;) 2 0, 


rj (U; ? Uo; 
C; + Ty (U;z , Uo;) 


K 2 max {a + djd;*a;) 





2p 1 
-1 er et Meyraeer hee Ie 
a 2 a0; | a ce; + 7,rj(U; =) 














Thus 
d Ss 
|e =u < K ||u — uz |I, t2 0. (19) 
Clearly, 
d a7 oi 
| Seo |= tim? Inet 9 ~ ue — u() + we t=0 
0+ € 
Also, for ¢ 2 0, the limit 
s/o 
Lim © [|] uC) = we I= [ue + 9 = we I 


+ 


+ 
exists and is equal to — " [| « — u. || in which as before c denotes 


the right-hand derivative (see Appendix B). But, since for any e > 0 
and t = 0, 








|| w(@) — we |] — |Pueé + 6) — ue || S [lu + 6) — ue. — ult) + Ue ||, 
we have 
d* || d 
~Fiju-wils |] So —wl], PaO: (20) 
Therefore, using equations (19) and (20), 
d* . 
qilv—w[l2—Kilu-ul, #20 
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and, for? = 0, 


|| % — we || 2 exp (—Ke) || u(0) — uw, ||. O 
IV. A SIGNIFICANT EXTENSION 


We can easily extend our results to cover an interesting class of 
networks containing diodes, capacitors (not necessarily linear), and 
(not necessarily linear) inductors, in addition to the elements of the © 
Fig. 1 network. 

Let each diode be represented by a model of the type shown in 
Fig. 7 in which 


lg = _ [cava + tafava)] + fa(va), 


with cz and rz positive constants. Assume that f,(-) satisfies the con- 
ditions placed on f,(-) and f,(-) of the transistor model. Let there be 
q diodes and let v2,,, and 72,44 (kK = 1, 2, --- , g) be the voltage and cur- 
rent associated with the kth diode. 

Suppose that the kth capacitor (we assume that there are r capacitors) 
is governed by 


d : 
dt [Coprqtk(Vop+ark)] = loptetk 


for k = 1, 2, +--+ , r, where Cop; ¢4%(-) is a strictly-monotone-increasing 
continuously-differentiable mapping of E* onto itself such that co+¢+1(0) 
= 0 and the slope of ¢2,194(-) is uniformly bounded from above and 
from below by positive constants. 
Finally, let there be s inductors which introduce constraints 
d 


di [linvonpiulonseerse) | = Voptatrek 





Fig. 7— Diode model. 
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for k = 1, 2, --+ , s, in which each l.,,,:,+z(-) is a function of the same 
type as the Copiqia(°). 

Assume that the linear resistive portion of the network introduces 
the constraint 


t= —Hi+B, Be® 


. ° ~ . ° ° tr ow 

In which? = i949 ¢: Fasavetes Vaprecsir g ** y Uapdecrxe) “9 B= 
. . tr 2 

(VU, , V2» °** 5 Vopsetr > tapeatrt1 > °° * » tepeqeres) » and A is a constant 

hybrid-parameter matrix of order (20 + q + 7r-+ 8). Then 


£ (C@] + PFO) + Ho = B 


where 
(C§@], = [CO], , j= 1,2, +++ , 2p 
= ¢0; + 7:f,0)), J = 2p+1,2p+2,--- ,2p+¢q 
= ¢,(v;), dS 2p Gar Wyo epr g et 


= 1,(2;), j 2p Pg Pe lf 2p ae eet 8 


T is the direct sum of matrices T7 @ I, @ 0,.. , in which I, is the identity 
matrix of order g and 0,,, is the zero matrix of order (r + s), and 


[FO]; = [FQ)]; , g=i1,2,-+-- , 2p 
= f;(v;), j 2p = las 4 2p tg: 


Under our assumptions C(-)~* exists and, with @ = C(o), 


“ + TRC @) + HO = B. (21) 


Let © denote the set of all real matrices M of order (2p + q +7 -+ 8) such 
that there exist positive constants d,, dz, °°: , Gopsgir+, With the 
property that 

(k) z dox—1 < 1 


ay Ck) 
dor, a, 


for k = 1, 2, --- , p (when p # 0) and diag (d, , do, +++ 5 dopsqirse JM 
is strongly column-sum dominant. 

With straightforward modifications of the arguments already pre- 
sented, we can prove (2) that for each & « H°?***"*® equation (21) 
possesses a unique solution defined on [0, ©) such that (0) = & , and 
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(iz) the analogs of Theorems 1, 2, and 3 and Corollary 1. To be more 
specific, the analogs of Theorem 1, Corollary 1, and Theorem 2 are: 


Theorem 1’: If H « 9, and &, and %, are solutions of equation (21) 
with B = B, and B = B,, respectively, for t = 0, and zf [B.(é) — B,()] 
—> 6 [the zero vector of E°?***"*?] ast— © with B, e B and B,  B, then 
[a.(t) —%(0)] > Oast— ~. 


Corollary 1’: Referring to equation (21), if H « D, and tf there exists 
a constant vector B,, such that [B(t) — B,] — 0ast— o, then there exists 
a constant vector ti,, such that [i(t) — t,| — 0 ast — ~, and ti, 1s in- 
dependent of the initial condition ti, . In particular, if B, = 0, thent.,. = 9. 


Theorem 2’: If the hypotheses of Corollary 1’ are satisfied with BQ) = 
B., fort = 0, then with 4, = (2p +q+7r-+s), we have 


» d; | H(t) — uss | S exp (—Kt) Did; | 4,0) — Ga; |, t20 
= £4 
for every set of positive constants d, , dz, +++ , dopsqrrae SUCh thatO < K = 


min {K, , Ko, Ks} where 


= min min 2 (1 — d;d;’ ay), = ~ Gis — ds aid;* | gis p} 


1si82p 


la 
| 


K, = min min (t.2 — (943 — > d; d;" | gi | p} 


2pt+1S7S2pt+e 


K; = min 2 (Gee — PS dids* | gsi p} 
Qptatlsis2ptatr+s \S; Pri 
in which s; = sup ci(-) forg = 2p +qt1,-::,2p4+qtr7; 38; = 
sup Ui(-) forg = 2p +qtrt1,:::,2p+at+trt+s; —a; zs the 


nonzero off-diagonal term in the jth column of T; and d; = dj, for j odd 
and d; = d;_, for j even. Moreover there exists one such set of constants {d;}. 


V. FINAL COMMENTS 


The results presented here are quite encouraging in that they are 
concerned with the equations of reasonably realistic nonlinear network 
models, and provide some understanding of a precise nature in an 
area where there is a great need for many results of similar type. 
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APPENDIX A 


Proof that 7;(ua; , Us;) 78 continuous. 


It is clear that r;(u.; , Us;) is continuous at each point ¢ such that 
Uaz(t) + U;(t). Suppose now that ¢ is such that u,,;(t) = ws;(t), and 
let « > 0 be given. Since u,; , Us; , g and f; are continuous, there exists 
6, > 0 such that 


If) {giluaré + al} — £3 {oiluar@]} | S « 


}} 
for all | 7 | S 6,. Then for | 7| S 6, either u.;(¢ + 1) = us;(¢ + 7) in 
which case 


| ri[uas(é + 2), wast + 2)] — rilutasQ, wi®)| S 
or ta; (t + 7) ¥ w;(t + 7) and (using the mean-value theorem) 


Filgsluas(t + 0)]} — fitgilus s(t + ))} 


Paltasls Fay Mose EY = OG tual + a] — gill + a 


= fi®) 
in which 
|é — g,{u.;] | S max { | g;[uas(t + 7)] 
—gs[uas(t)] | , | giles + 0)] — gilttas(2)] | }. 


In the latter case, there exists 6. > 0 such that | f’(é) — f’ {g,[u.;(]} | 
S eforall| |S 6,. Thus for all| 7 | S min {6,, &}, we have 


| 1; [Uaz(t a n); Us; (t = n)] as 1[Ua; (2), Uy; (t)] | SV. 
APPENDIX B 


Proof that the Right-Hand Derivative of |x; | exists and is equal to 
s(x;) (xi. 


If ¢ is a point such that 2;(¢) ¥ 0, then it is clear that 
d* , 
ai | 2; | = s(z,)Oaxj(2). 

At t such that z;(é) = 0 and z/(@) ¥ 0, 


Lim aCe.) (0) BOT Sal. 


s(x) (Dx; 
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Finally if «;() = 0 and 2/() = 0, then 


O0= lim |2{(¢) | = lim le@@+o) | _ a | a; |, 
€0+ e30+ € dt 
tststt+e 


since x; is continuously differentiable. 
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Adaptive Equalization of Highly Dispersive 


Channels for Data Transmission 


By ALLEN GERSHO 


(Manuscript received May 29, 1968) 


This paper analyzes an adaptive training algorithm for adjusting the 
tap weights of a tapped delay line filter to minimize mean-square inter- 
symbol interference for synchronous data transmission. The significant 
feature of the adjustment procedure is that convergence is guaranteed for 
all channel response pulses, even for very severe amplitude and phase dis- 
tortion. 

The author examines convergence, rate of convergence, and the effect 
of noisy observations of the received pulses, and he shows that the noisy 
observations result in a random sequence of tap weight settings whose mean 
value converges to a suboptimal setting. The mean-square deviation of the 
tap weights from the suboptimal values is asymptotically bounded with a 
bound that can be made as small as desired by sufficiently reducing the 
speed of convergence. 

The suboptimality arising here results from the use of isolated test pulses 
for the training signal. However, a training scheme using pseudorandom 
sequences or the actual data signal does not suffer from the suboptimality 
effect. Hence, although of possible utility in other pulse shaping applications, 
the technique presented here appears to be primarily of value in providing a 
conceptual framework for the closely related but more practical techniques 
to be examined in the sequel to this paper to be published shortly. 


I. INTRODUCTION 


A common approach to data transmission is to code the amplitudes 
of successive pulses in a periodic pulse train with a discrete set of 
possible amplitude levels. The coded pulse train is then linearly 
modulated, transmitted through the channel, demodulated, equalized, 
and synchronously sampled and quantized. As a result of dispersion 
of the pulse shape by the channel, the number of detectable amplitude 
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levels has very often been limited by intersymbol interference rather 
than by additive noise. 

In principle, if the channel is known precisely it is virtually always 
possible to design an equalizer that will make the intersymbol inter- 
ference (at the sampling instants) arbitrarily small. However, in 
practice a channel is random in the sense of being one of an ensemble 
of possible channels. Consequently, a fixed equalizer designed on av- 
erage channel characteristics may not adequately reduce intersymbol 
interference. An adaptive equalizer is then needed which can be 
“trained,” with the guidance of a suitable training signal transmitted 
through the channel, to adjust its parameters to optimal values. If the 
channel is also time-varying, an adaptive equalizer operating in a 
tracking mode is needed which can update its parameter values by 
tracking the changing channel characteristics during the course of 
normal data transmission. In both cases the adaptation may be 
achieved by observing or estimating the error between actual and 
desired equalizer responses and using this error to estimate the di- 
rection in which the parameters should be changed to approach the 
optimal values. 

A simple and effective technique for adaptive equalization was de- 
veloped by Lucky using the tapped delay line filter structure for 
the equalizer)? The main limitation of this technique is that con- 
vergence of the tap weight adjustment algorithm is assured only for 
relatively low dispersion channels. The convergence condition re- 
quires that the dispersed pulse shape have adequate quality so that, 
in the absence of noise, error-free binary data transmission would be 
possible without equalization. In other words the dispersed pulse must 
have an open binary “eye.” 

' Using an approach to adaptation® * with virtually unrestricted con- 
vergence properties, Lucky and Rudin subsequently proposed and 
implemented an adaptive equalizer for minimizing the mean square 
error in frequency response of an analog channel.®:* This approach 
was applied to synchronous data transmission by the author and in- 
dependently by Lytle and by Niessen.*-° An implementation of the 
technique was described by Niessen and Drouilhet.*® It has also been 
implemented for data communication at Bell Laboratories. 

- In this paper the approach is used for synchronous data transmis- 
sion in a training mode where a sequence of isolated pulses is used 
as a test signal. The technique may be viewed equally as an adaptive 
design procedure for a sampled-data pulse shaping filter where the 
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error criterion is to minimize the mean square error between actual 
and desired pulse shapes at the filter output. The important feature 
of the technique is that convergence is achieved for any channel 
pulse response whatever, thereby including highly dispersed pulses 
for which even binary data transmission would be impossible with- 
out equalization. Of particular interest are: (7) the analogous optimal- 
ity condition to Lucky’s zero forcing condition resulting with the 
change from a summed absolute error to a summed squared error 
criterion,’ (2) the manner in which noisy observations introduce ran- 
domness in the iterative corrections to the weights and the resulting 
stochastic convergence properties, (iii) the possibility of applying the 
technique where isolated pulses applied to a filter must be used to 
adaptively adjust the filter for optimum pulse shaping (unrelated to 
equalization), and (iv) the conceptual framework for the more prac- 
tical adaptation techniques to be described in a sequel to this paper, 
planned for publication soon. 

Perhaps the earliest application of the tapped delay line or “trans- 
versal” filter to pulse shaping for data transmission was made by 
W. P. Boothroyd and E. M. Creamer.'! Tufts and George have shown 
that under a mean-square error criterion the optimal receiver struc- 
ture includes a tapped-delay line filter with delay between taps equal 
to the symbol period.?? 7" Aaron and Tufts have also shown that the 
same receiver structure is needed to minimize the average error prob- 
ability for binary data transmission. 

The basic approach to adaptive adjustment of a set of weights 
where a mean-square error criterion is used with a gradient search 
procedure was considered by Widrow and Hoff who noticed that no 
derivative computation is needed.? Narendra and McBride proposed 
a self-optimizing Wiener filter using a continuous-time gradient 
algorithm and a filter structure whose transfer function is a weighted 
sum of fixed functions. Koford and Groner used a mean-square error 
criterion and a gradient learning algorithm to find an optimum set 
of weights for pattern classifying.1> Widrow described a general adap- 
tive filtering problem with the tapped delay line filter.t® Coll and 
George discussed the performance of George’s optimum equalizer and 
indicated a possible adaptive adjustment technique.’ Lucky and Rudin 
were the first to apply the mean square error criterion with the 
gradient search procedure to the field of adaptive equalization.® ® 
This paper expands on a short presentation given at an international 
symposium on information theory,"® 
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II. PERFORMANCE OBJECTIVES FOR EQUALIZATION 


The objective of equalization, viewed as a pulse shaping problem, 
is to adjust the parameters of the equalizer to a setting which mini- 
mizes a suitable measure of the error between actual and desired 
pulse shapes. For the usual synchronous data transmission applica- 
tion, the desired pulse shape is one with the Nyquist property that 
the sample values y, at the sampling instant kT are given by yz = 
Sxr Where 8; is unity for k = r and zero for all other integers k. The 
criterion used by Lucky? is peak distortion, D, given by 


D= Tu Vol: 


An alternate criterion of interest is the mean square distortion £, 
defined by 


E= Dvi/y. 
kr 


The physical interpretation of the peak distortion is that it is di- 
rectly related to eye opening and determines the error probability for 
a worst case message pattern. The mean square distortion has a dif- 
ferent interpretation. If the message pattern is such that the trans- 
mitted level for each time slot is statistically independent of the levels 
for other time slots, then the variance of the intersymbol interference 
in a given time slot is proportional to the mean square distortion. If 
the pulse shape has a large number of small sidelobes so that the 
intersymbol interference is normally distributed, then minimizing 
mean square distortion is equivalent to minimizing error rate. 

Closely related to the mean square distortion is the mean square 
error 


o> 2d (yi. — d,)° (1) 


where d, is the desired pulse sample value at time instant kT. For the 
usual equalization problem where d;, = 6,, , the measure & has virtually 
the same interpretation as #; however, / is a normalized measure in- 
dependent of pulse amplitude while & depends on both shape and ampli- 
tude. Optimization of the tapped delay line equalizer with respect to 
elther criterion leads to equivalent results. 


II. FORMULATION 


Consider the transversal equalizer with N taps and tap spacing T 
equal to the symbol period. Let c; be the weight at the kth tap for 
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k = 0,1, --- , N—1so that the input output relation of the transversal 
filter at the sample times is 


N-1 


Un = Li Cln-e = CK (2) 
k=0 

where 2, and y, denote the input and output pulse samples, respectively, 
at time instants kT, c = (c) , ¢; , «++ , €y-1) is the tap weight vector, 
and X, = (, , 2n-1) °** » La-v+i) is the sample memory state of the 
delay line at the time instant nT; the vectors c and x, are to be regarded 
as column matrices, and the prime denotes the transpose. We assume 
that the input sequence 2, has finite energy. Let «¢, = y, — d, . Then 
from equation (1), using (2), the gradient of the error with respect to c 
may be written as 


Ve=2 Dam. (3) 


Therefore the optimality condition for minimum error V& = 0 is 
equivalent to the requirement that the (deterministic) corss-correlation 
between the input sequence x, and output error sequence e, must have 
zeros for the N components with index values corresponding to the index 
values of the available tap weights. That is, 


or(k) = >> etn, = 0 for k =0,1,2,---,N —1. 


This condition has an interesting similarity to Lucky’s condition 
which states that the peak distortion, D, is minimized when the error 
sequence e, has zeros for the N components with index values corre- 
sponding to the index values of the available tap weights.1 An im- 
portant distinction is that Lucky’s condition is generally not valid 
when the input pulse distortion D exceeds unity, while the mean 
square optimizing condition is valid for any input pulse with finite 
mean square distortion. 

Using equation (2), the gradient (8) can be expressed explicitly as 
a function of the tap weight vector c, namely: 


Vé& = 2(Ac — g) (4) 
where 


A= >>xxi, and g= > dx,. 


Notice that A is symmetric and positive definite (see Appendix A). 
Setting equation (4) equal to zero yields the solution for the optimum 
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tap weight vector c*, 
c*¥ = A''g. 


Using equation (2), the error expression given by equation (1) may 
be expressed in the convenient form: 


&(c) = &(c*) + (c — c*)’A(e — c*) (5) 


which shows explicitly the simple quadratic nature of the error surface 
and the unique optimality of the minimizing weight vector c*. It can 
be shown that the residual error &(c*) can be made as small as desired 
for all channels of practical interest by using a sufficiently large number, 
N, of taps.” 

It is intuitively reasonable that successive corrections to the tap 
weight vector in the direction of steepest descent of the error surface 
should lead to the minimum error where c = c*. This is the idea of the 
well-known” gradient algorithm: 


Ci4. = C3 ZzaV &(c;) (6) 


where a is a suitably small positive proportionality constant, Cy is 
arbitrary, and c; is the tap weight vector after the 7th iteration. 

The significant feature of the gradient algorithm for our quadratic 
error surface (5) is that the gradient can be conveniently evaluated 
without knowledge of the error surface itself. We have seen from equa- 
tion (8) that the components of the gradient vector are values of the 
crosscorrelation between the input sequence and the output error 
sequence. This suggests the conceptually simple implementation 
where an isolated test pulse is transmitted through the channel and the 
requisite crosscorrelation values are formed by multiplying the de- 
layed input pulse with the error pulse, sampling, and summing (or 
averaging). The tap weights are then incremented according to (6), 
the old crosscorrelation values “dumped” and a new iteration is begun 
with the transmission of a new test pulse. 


The error pulse is formed by subtracting from the equalizer output 
pulse an “ideal” pulse whose sample values are the desired values d, ; 
the ideal pulse is locally generated at the appropriate time. The basic 
scheme is shown in Fig. 1. Naturally, the summation given by equation 
(3) cannot be performed over an infinite time interval. Suppose «7 
is a practical upper bound on the possible time duration of the input 
pulse, 7 is the time interval between successive test pulses with 7 >xT, 
& and x as positive integers. Then if we include the effect of perturbing 
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Fig. 1 — Four tap training mode adaptive equalizer. 


receiver noise samples n; and z; at the equalizer input and output, 
respectively, the measured crosscorrelation vector @; after the ith 
iteration is given by: 


lottitk 


a= DD (Xi-ze + mi)(er-sg + 21). (7) 
T=Ioti 
In the noiseless case the estimate 6; reduces to one-half the deterministic 
gradient, that is, }V&(c,;) under the assumption that the pulse sequence 
x, and desired sequence d, are virtually zero outside of the interval 
Lslskht«e—N+1. 


IV. CONVERGENCE PROPERTIES 


In the presence of noise the tap weight corrections contain undesired 
random components consisting of products of input and output noise 
samples and products of pulse and noise sample. As a result, the 
random tap weights no longer converge to the optimal values but 
instead approach some neighborhood of a suboptimal setting and then 
fluctuate randomly about this setting. The error between the optimal 
and suboptimal settings is small for low noise levels and decreases 
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with increasing signal-to-noise ratios. The size of the fluctuation neigh- 
borhood about the suboptimal setting is proportional to the noise level 
but can be made as small as desired by making the training time suf- 
ficiently long. | 

Assume the noise samples n; have zero mean and finite variance o’. 
Define the vector ny = (n,, M41, °** » Me-w 4.1) to be regarded as a col- 
umn matrix. Then the output noise samples of the equalizer are: 


2, = cn. (8) 


Define the matrix B = E(n,n/), where H(---) denotes the expected 
value. Notice that B is symmetric and positive semidefinite. 

To formulate the iterative equations describing the tap weight be- 
havior in the presence of noise, apply equations (2) and (8) to (7) to 
show how the gradient estimate depends on the tap weight vector: 


o; = > (X:-¢¢ + m2) [(Kr-ve + 1)’c; — dr-i:]. 


1 


Hence 
@ = He—-g-v., (9) 
where H, is the random symmetric matrix 
Hi, = > (Xr-sg Hi) (Ki-sg + 1)’ (10) 
and 
v; = den Ga (11) 


Let @ = E(H;), the expected value of H; . Then equation (10) yields 
@=A-+ xB. (12) 


which is positive definite since A is positive definite and B is positive 
semidefinite. 

It is convenient to examine the random variation of the tap weight 
vector c, about the suboptimal setting defined by 


c= @'g, (13) 


and letq; = c; — ¢. From equation (12) it is evident that the suboptimal 
setting € approaches the optimal setting c* as the ratio of noise variance 
to input pulse sequence energy approaches zero. The iterative algorithm 
may be expressed in the form 


Git = Qi — ao; ) (14) 
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6; = Hq; +h; (15) 
where 
h, — H,¢é ae g SS Vises (16) 


Equations (14) and (15) constitute a system of first-order stochastic 
difference equations with a forcing function h; which is statistically de- 
pendent on the stochastic state matrix H; . We assume that the per- 
turbing noise samples in different iterations are uncorrelated, so that 
H; and h; are independent of H, and h; for z ¥ 7. Notice that the ex- 
pected value of any function of H; and h,; is independent of +. Under 
these conditions it is proved in Appendix C that for suitably small values 
of a the mean value of the solution vector q; approaches zero as 2 — © 
and the sum of the variances of the components of q; is bounded with a 
bound that approaches zero as a approaches zero. Consequently the 
mean value of the tap weight vector converges to the suboptimal set- 
ting € while the actual tap weights fluctuate randomly about the con- 
verging mean values with a variability that can be made arbitrarily 
small. 

Notice from Appendix C that the norm of the mean solution vector 
(q); is reduced at least by the factor ¢, the spectral norm” of I — a@. 
Let p, and py denote the minimum and maximum eigenvalues, respect- 
ively, of @. Then 


¢ = min|1—ap,|, |1— apy|. (17) 


(For proof see p. 24 of Ref. 20.) 

Then for 0 < a < 2/(p, + py), we obtain ¢ = 1 — ap, . Consequently, 
while decreasing a offers a smaller bound on variability of the tap weight 
vector, increasing a assures a stronger bound on convergence rate. For 
the training mode it is likely that speed of adaptation will be relatively 
unimportant so that a very small value of a could be used to approach a 
tap weight setting that is very close to the suboptimal setting. 

It is useful to obtain bounds on the eigenvalues of @ which can be 
determined without specific knowledge of the channel characteristics. 
If x(t) denotes the channel pulse response and n(t) the additive receiver 
noise so that the sampled values used earlier are given by 2, = x(kT) 
and n, = n(kT), then the sampled spectrum X*(w) of 2, is 


Xe) = Det = ; © X(w — 2/1) 


and the sampled spectral density S*(w) of 7; is 


$+) = L Eomde i = 7 D Slo - 24/7) 
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where X(w) is the Fourier transform of x(t) and S(w) is the spectral 
density of n(é). Let m and M denote the infimum and supremum, 
respectively, of | X*(w) |? + «S*(w) so that 


m < | X*(w) + eS*(w) <M. (18) 


In all cases of practical interest 1 will be finite; furthermore generally m 
will be greater than zero. It is shown in Appendix B that each eigenvalue, 
p; , of @ will be bounded according to 


mSp,SM. (19) 


To illustrate the use of this bound, notice from Appendix C that the 
condition for convergence of the mean tap weight vector to the subopti- 
mal solution is that a < 2/py . Thus a sufficient condition is that 


a<2/M. (20) 


Furthermore, the mean tap setting converges exponentially with the 
convergence factor ¢, given by equation (17). Hence it can be inferred 
that the choice of a which provides the strongest bound (least value of 
§) is @ = 2/(p: + py) yielding 

_pat 
where p = py/p, . Using the bounds given in (19) we obtain p S M/m, 
and so 


(sm. 
~M+m 


Therefore, for the best choice of a, convergence of the mean proceeds at 
least at a rate given by the geometric factor (MZ — m)/(M + m). Thus 
useful information regarding the convergence speed can be determined 
without knowledge of the channel characteristics. 


(21) 


Vv. CONCLUSION 


The degree of suboptimality of the tap weight setting reached by the 
training algorithm may or may not be consequential, depending on the 
application. In applications where multilevel pulse transmission with a 
large number of levels could be achieved with adequate equalization, 
the signal-to-noise ratio is necessarily very high and therefore the degree 
of suboptimality is not large. Even when the noise level is fairly sub- 
stantial the suboptimal setting may still be adequate if the error surface 
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given by &(c) is “shallow” in a large neighborhood of the minimum. 
Then a fairly large departure of € from c* may correspond to a rela- 
tively small increase in mean-square error. Also, if training mode adapta- 
tion is used as a prelude to tracking mode adaptation, a fairly large 
degree of suboptimality may be a tolerable starting point for a tracking 
mode operation such as the one we plan to describe in a future paper. 

When the noise level is substantial the criterion for optimality used 
here becomes inadequate because it does not consider the effect of the 
equalizer on the receiver noise. The price of reducing intersymbol in- 
terference may be a sizable increase in noise level at the equalizer 
output. In our future paper the error criterion is modified to include 
noise with the result that the problem of suboptimality does not arise. 

The random fluctuation of the tap weights which prevents true con- 
vergence to the suboptimal setting can be eliminated by reducing the 
proportionality constant @ in each iteration using a sequence of step 
sizes a, with the properties 


>a = © and ye ee 

It may then be shown that the tap weight vector converges to the 
suboptimal solution with probability 1. The proof uses stochastic 
approximation theory and follows the lines taken by Tong and Liu 
who considered a training mode algorithm for low dispersion chan- 
nels.2* However, this modification complicates the implementation 
somewhat and cannot be applied to the tracking mode adaptation 
problem. 


APPENDIX A 
Proof that A is Positive Definite 
The matrix A is defined by 


A= >> xx. (22) 


k=—o 


Consequently 
Ac = >> c’/x,xle = iyi. 


But the sequence y; is the convolution of the x, sequence with the finite 
tap weight sequence c, . Hence, using Parseval’s equality, 


c’Ac = = -f | X*(u) f° | Cl) [P do, (23) 
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where 
N-1 . | 
Cw) = ae 
k=0 


Equation (23) shows immediately that c’Ac is nonnegative for all 
vectors c. Also, C(w) can have only isolated zeros and | X*(w) | is square 
integrable since the input pulse has finite mean square distortion. It may 
then be inferred that c’Ac > 0 unless c = 0, which proves that A is 
positive definite. 


APPENDIX B 


Bounds on the Eigenvalues of @ 


Since B = /(n,n/) the quadratic form c’Bc is the mean squared value 
of y; of the response of the equalizer with weight vector c to the input 
noise n; . Consequently 


Be = 2 [ g*@) | Cle) | de. (24) 


2a —1/ 
Combining equations (23) and (24) yields 
n/T 
ctec= sf {| X*@) FP +x8*@)} [CW Pde. 25) 
2a —1/T 
Applying to equation (25) the bounds m and M given by equation (18) 
yields 
me’c S cac S Me'c. (26) 


Let c be the eigenvector of @ corresponding to eigenvalue p. Then 
@c = p;c and equation (26) yields 


msp5M (27) 


which provides a convenient bound for the largest and smallest eigen- 
values of @. 


APPENDIX C 


Convergence Proof 


To examine the convergence properties of the tap weight adjustment 
algorithm, it is convenient to define the norm of a random vector u as 


ul] = wy)”, (28) 
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so that the squared norm of u is the sum of the second moments of the 
components of u. For a deterministic vector the norm reduces to the 
usual Euclidian norm. The norm of a deterministic matrix will denote the 
usual spectral norm.” 


Theorem: Let Hy, be a sequence of random symmetric N X N matrices 
and h, a sequence of random N-tuple column vectors. Suppose H, and 
h, are stationary in k with H, and h, independent of H; and h, fork # 7. 
Assume h, has zero mean, and the elements of H, and h, have finite vari- 
ance, HH, = @, independent of k with @ positive definite. Define the random 
vector sequence q, according to: 


Gi+1 = Gz — AQ, (29) 
where 
Oo. = Hq, +h, (30) 
fork = 0,1, 2, --- and qo ts an arbitrary deterministic vector. Then for a 
positive and sufficiently small, 
lim || Ziq, || = 0 (81a) 
k-00 
and 
lim sup || ae || S$ V@) (31b) 
k-00 
with V(a), given in (47), satisfying: 
lim V(a) = 0. (32) 
a0 
Proof: Combining equations (29) and (80) yields 
Qrs1 = (I — aHy)q, — ch, . (33) 


Noting that q, is independent of H, , taking the expected value in 
equation (33), we find 


EQ) = I — e@) EQ). (34) 
It follows then that 
| Za) || <= || Fw || _ (85) 


where 
c= ||I-ae@||. (36) 
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Hence equation (31a) follows when ¢ < 1, or equivalently, for 
0<a < 2/py (37) 
where py is the largest eigenvalue of @. 
To prove equation (31b), observe that 
BQtsdees) = Elaid — oH;)’a.) — Eoai(T — oH,)hi] + o? || hy ||? 
(38) 
from equation (33). Noting again that q, is independent of H; , we have 
Ela — oH,)’q.) = E{qiE((I — oH,)Ja.} Sv || a II, (39) 
where 
u = || E[Z — eH,)71 ||. (40) 
Also, using the Schwarz inequality, 
E(—qidl — oH,)h,] = aqiE(Hh,) S @ || a || f 
where f = || #(H,h,) ||. Using equation (85) we obtain 


—Elq.. — oH,)h,] S ag | Bo) |I f- (41) 
The bounds (39) and (41) may be applied to equation (38), yielding 
ees (I? Se [lage ll? + ef || Bao ll +o? |b. |]?. (42) 


If we now define the bounding sequence of positive numbers Q,; ac- 
cording to 


Qo = || Eq ||° 
and 
Ques = uQ;, + af {| E(q) | * + a ||h, ||? ’ (43) 
then it follows from (42) that 
lla. |? SQ. 


But the difference equation given by (43) has the asymptotic solution 


2 2 
lim Q, = @ lel 


’ 


k=*00 — #-E 
for £< land » < 1. Then 
2 2 
Lim sup {| (P= SL BeIE (44) 
ke 
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Notice that || h; || is independent of k by the hypothesis of station- 


arity. 
Since 
(I — eH,)? = (I — ae)? + ao’ E(G?). 
where 
G, = H, << a, (45) 

we find that 

nS || 1 —a@ |P +o? || EGP || “igs 

eS +a’y 
where y = || Gj ||. Furthermore for a < 2/(p; + py), we have § = 1 — 
ap, . Then, using (46), we see that 

a a 





< SS ST - 
1 — p= 2ap, + a(p: + 7) 


We have therefore shown that for positive and sufficiently small a, 
equations (1b) and (32) are valid where 


V@ = (47) 


a 
2p: + a (pi + 7) 
REFERENCES 


1. Lucky, R. W., “Automatic Equalization for Digital Communication,” BS.TJ., 
44, No. 4 (April 1965), pp. 547-588. ; 

2. Lucky, R. W., “Techniques for Adaptive Equalization of Digital Communica- 
tion Systems,” B.S.T.J., 45, No. 2 (February 1966), pp. 255-286. 

3. Widrow, B., and Hoff, M. E., Jr., “Adaptive Switching Circuits,” IRE Wescon 
Conv. Record, Pt. 4 (August 1960), pp. 96-104. 

4. Narenda, K. 8. and McBride, L. E., “Multiparameter Self-Optimizing Sys- 
tems Using Correlation Techniques,” IEKE Trans. Automatic Control, 
AC-9, No. 1 (January 1964), pp. 31-38. 

5. Lucky, R. W. and Rudin, H. R., Generalized Automatic Equalization for 
Communication Channels”, Proc. IEEE (Letters), 64, No. 3, pp. 439-40, 
March 1966. 

6. Lucky, R. W. and Rudin, H. R. An Automatic Equalizer for General-Purpose 

near Channels, B.S.T.J., 46, No. 9 (November 1967), pp. 2179- 

. Gersho, A., unpublished work. 

. Lytle, D. W., unpublished work. 

. Niessen, C. W., “Automatic Channel Equalization Algorithm,” Proc. IEEE, 

55, No. 5 (May 1967), p. 689. 

10. Niessen, C. W., and Drouilhet, P. R., “Adaptive Equalizer for Pulse Trans- 
mission,” Conf. Digest, IEEE Int. Conf. on Commun. (June 1967), p. 117. 

11. Boothroyd, W. P. and Creamer, E. M., Jr., “A Time Division Multiplexing 
System,” Trans. AIEE, 68, Pt. I, (1949) pp. 92-97. 

12. Tufts, D. W., “Nyquist’s Problem—the Joint Optimization of Transmitter 


oan 


70 


13. 
14. 


15. 


16. 


17. 
18. 


19. 


20. 
21. 
22. 


THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1969 


and Receiver in Pulse Amplitude Modulation,” Proc. IEEE, 63, No. 3 
(March 1965), pp. 248-259. 

George, D. A., “Matched Filters for Interfering Signals,” IEEE Trans. In- 
form. Theory IT-11, No.1 (January 1965), pp. 153-154. 

Aaron, M. R. and Tufts, D. W., “Intersymbol Interference and Error Prob- 
ie IEEE Trans. Inform. Theory IT-12, No. 1 (January 1966), pp. 

Koford, J. S., and Groner, G. F’., “The Use of an Adaptive Threshold Ele- 
ment to Design a Linear Optimal Pattern Classifier,’ [EEE Trans. Inform. 
Theory, JT-12, No. 1 (January 1966), pp. 42-50. 

Widrow, B., “Adaptive Filters I: Fundamentals,” Stanford Electronics Lab., 
Stanford, Calif., Report SEL-60-126 (Technical Report 6764-6), December 
1966. 

Coll, D. C. and George, D. A., “A Receiver for Time-Dispersed Pulses, Conf. 
Record, 195, IEEE Annual Commun. Conv., pp. 753-757. 

Gersho, A., “Automatic Equalization of Highly Dispersive Channels for Data 
Transmission,” Int. Symp. on Inform. Theory, San Remo, Italy, September 
1967. 

Gersho, A. and Freeny, S. L., “Performance Capabilities of Transversal 
Equalizers for Digital Communication,” Conf. Digest, IEEE Int. Conf. on 
Commun. 1967, p. 88. 

Goldstein, A. A., Constructive Real Analysis, New York: Harper and Row, 
1967, Chap. 1. 

Tong, P. S. and Liu, B., “Automatic Time Domain Equalization in the 
Presence of Noise,” Proc. Nat. Elec. Conf., 23, (October 1967), pp. 262-266. 

Varga, R. S., Matrix Iterative Analysis, New York: Prentice-Hall, 1962. 


On the Probability of Error Using a 
Repeat-Request Strategy on the 
Additive White Gaussian Noise Channel 


By A. D. WYNER 
(Manuscript received July 5, 1968) 


An upper bound on the error probability is obtained for digital communi- 
cation (with average power P, and no bandwidth constraint) in the presence 
of additive white gaussian noise (with one-sided spectral density N,) with 
the use of a noiseless feedback link. A repeat-request strategy is used: the 
receiver decodes a signal only when it is relatively sure that one particular 
message was actually transmitted, otherwise it requests (via the feedback 
channel) a retransmission. We show that as the coding delay T becomes 
large, we can transmut at an effective rate R < C = P,/N,, the channel 
capacity, with error probability P, approximately exp {—T|(VWC — 
VRP +C = R]}, which is a considerable improvement over the reliability 
attainable with a one-way channel. These results parallel those obtained 
earlier by Forney for the discrete memoryless channel. 


I. INTRODUCTION 


In a recent paper, Forney studied a repeat-request strategy for 
communication of digital information over a discrete memoryless 
channel when a fedback channel is available.? In this system the 
receiver decodes a received message only when it is relatively “sure” 
that one particular message was actually transmitted. If the receiver 
is not confident that one particular message was actually transmitted, 
then it requests (via the feedback channel) that the transmitter repeat 
the message. Forney showed that considerable improvement in the 
resulting error probability (over the best one-way scheme) was ob- 
tainable with a negligible degradation in the effective rate of trans- 
mission. In this paper we apply Forney’s ideas to the additive white 
Gaussian noise channel (with no bandwidth constraint) and obtain 
analogous results. Furthermore, our coding scheme is constructive— 
the codes being orthogonal codes. 
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We will consider the following channel. The channel input signal 
is a real-valued function s(t), defined on the interval [0, 7], which 
satisfies the “energy” constraint 


/ 21) dt = PrP. (1) 


The average signal ‘‘power” is therefore P,. The channel output r(é) 
is the sum of s(t) and a sample n(é) from a white Gaussian noise process 
with one-sided spectral density N, (and with mean zero). By expanding 
s(t), r(t) and n(é) on any orthonormal basis of £.[0, J], it is easy to 
show that an equivalent channel model is as follows.*”® (This equivalent 
channel model is the one we use in this paper.) The input signals are 


are (semi-infinite) vectors x = (x, , %2, ---) which satisfy 
t-) 
> 22 = AT. (2) 
k=] 

The channel output is a vector y = (y;, Y2, °** ), where 


Y,= +e, k=1,2,-:-, 


and the z,(k=1, 2,---) are independent Gaussian variates with zero mean 
and unit variance. The parameter A is equal to 2P,/N,, and we as- 
sume that A is held fixed throughout the paper. We also assume that it 
takes TI’ seconds for the channel to process x, and that successive T- 
second transmissions are independent. 

A code with parameters M and T is a set of M signals (called ‘‘code 
vectors” or ‘‘code words”) x; = (#1, ti2, °°*), 7 = 1,2, --: , M, 
which satisfy equation (2), that is 


Ya, = AT, *=1,2,-°-,M. (3) 


We assume that each of the M code words is equally likely to be trans- 
mitted, so that the transmission rate is R = 1/T \In M nats (natural 
units) per second, and M = e””. It is the task of the receiver to examine 
the channel output y and to announce the code word, say D(y), which 
it believes was actually transmitted. Let P.; be the probability that 
D(y) # x; given that x; is transmitted. The overall error probability 
is therefore | 


1 M 
P, =a Se P ik 


t=1 


It is easy to show that for a given code, the “optimal” decoding rule D 
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(which minimizes P,) selects for D(y) that code word x; which maximizes 
(with respect to 2) the inner product 


X;,y) = 2 LinYr 


Define P*(M, T) as the smallest attainable error probability P, for 
a code with parameters M and 7. Set M = [e””], and let T — © with 
the rate & held fixed. Then it is well known that if R < A/2 = 
P,/N, = C, the “channel capacity,” 


P%((e*"], T) = exp {-E(R)TU + (T)]}, (4) 


where E,(R) > 0, and e(T) > 0 as T — o.''*° Thus at rates R < C, 
the error probability tends to zero exponentially in 7’. Further, for rates 
R > C, P*({e”7], T) — 1, so that the capacity C is the supremum of 
the rates for which ‘‘error-free’’ coding is possible. 

Although this type of behavior of P* is typical of a large class of 
channels, the present channel is unique in two ways. First the exponent 
E(B) is known exactly, namely 


C/2 — R, O<RESC/, 


< 
E(B) = 
[(Ci-RP?, C/A SREC. 


(5) 





Second, an explicit construction of codes which achieve error probability 
as in equation (4) is known. In fact, P, as in equations (4) and (5) 
can be achieved when the code is any set of M orthogonal vectors. 
The simplest such code is that for which z,, (the kth coordinate of x;) 
is given by 


(AT)*, k = 3, 
0, k x 4, 


7=1,2,---,M, k=1,2,---. (6) 





For this orthogonal code, the inner product of y and the 7th code word is 
(x;, y) = y(AT)}, 7=1,2,---,M; 
so that the optimal decoding rule is 
Dy =x, if yi> Y¥; for all 7 ¥ 2, lsjsmM. (7) 
With probability one, (7) is satisfied for exactly one 7. Notice that 
the coordinates y;(j > M) are irrelevant to the receiver. Further, 


from the symmetry of the orthogonal code (6), we can without loss 
of generality, assume that code word x, is transmitted. Hence, the 
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error probability is 


M 


Pe=Pa=PrU tu Sui}, (8) 
where the probability is computed with {y;}" independent unit variance 
Gaussianrandom variables with Hy, = (AT)* and Ey; =0(2 <j M). 

Now suppose we can use a noiseless feedback link. As before, we 
transmit one of a set of M = e”” orthogonal signals {x;}", where x; is 
given by (6). Instead of the decoding rule (7), let us use the rule 


Dy) =x, ify >y;, +A foralj Az, LSjSM, (9) 


where A > O will be chosen later. If no y; satisfies (9) then we 
request a retransmission via the feedback channel, and use (9) on 
the second received vector, and so on. The probability of error de- 
creases as A increases. The price which we pay for this increased reli- 
ability is an increase in the length of time which it will take to complete 
the transmission of the M-ary message, and the consequential reduction 
in the effective rate of transmission. In fact, let EZ, be the event that we 
ask for a retransmission, and let P(£,) be its probability. Then from 
the assumption that successive transmissions are independent, the 
expected number of T-second transmissions required to accept a 
message is 


>. j Pr {j transmissions are required} 


7=1 


¥ i — PwEqPway = 1 — Pw) Y Pee 


1 1 
iL Ee Pao b= Pe) 


Thus the average length of time required to transmit the /-ary message 
is T = T/(1 — P(E,)). If P(E) is small, then 7 is not much greater 
than 7’. 

Suppose that we use this repeat-request strategy repeatedly—that is, 
if the receiver does not call for a retransmission, then the transmitter 
sends a new M-ary message. lor k = 1, 2, --- , let the random variable 
N,, be the number of M-ary messages which the receiver accepts (that is, 
it does not call for a retransmission) in kT seconds. Then we can write 


ONS 2285 
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where the random variables é; = 1 if the receiver accepts a message 
on the jth T-second interval, and €; = 0 otherwise. Note that Pr{é; = 
0} = P(H£»), and that the {é},", are independent (since we have as- 
sumed that successive 7'-second transmissions are independent). Thus 


(1) E(N;,) = kE(&) = kd — PEs)) 
(ii) N,/k—>1— P(Er), ask, (10) 
with probability 1. 


Statement (iz) follows from the strong law of large numbers (see Ref. 
3, p. 190). Since each M-ary message contains In M = RT nats, the 
effective rate of transmission R, in the light of (10), 


5 _ [E(N,)|kT 
R= —*—+— nats/sec 
kT (11) 


= Ril — P(E,)] = R(L/T). 


Let us turn our attention to the probability of error. Since we are 
using the orthogonal code of (6), we can, as above, without loss of 
generality, assume that code word x, is transmitted. Using the decod- 
ing rule of equation (1.9) we make an error only when for some j > 1, 
y; > y; + Aforalli = 1,2,---, Mand ¥ j. (In this case D(y) = x;.) 
Thus the error probability is 


AML 
P.=Pr Uf) fyi > ys + A}. (12) 
7=2 iX7 
As in (8), the probability in equation (12) is computed with Hy, = 
(AT)? and Ey; =0(2 Sj < M). 
Let us further define , as the event that either an error occurs or a 
repeat-request occurs. If x, is transmitted, H, has probability 


M 
Pr (Z,) = Pr U (fy Sy; + A}, (13) 

7=2 
where as above, the probability in (13) is computed with Fy, = (AT)? 
and Ey; = 0,7 > 1. Clearly the probability of a repeat-request is 
P(E) = P(E,) — P. S P(#)). (14) 
Consider the parameter A. In the interest of minimizing P,, we want 
to make A large. However, in the interest of minimizing P(E,) and 


therefore making R as close to & as possible, we want to make A small. 
The approach which we will take is to choose A just small enough so 
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that as the parameter JT — © (8 is held fixed), P(Z,) — 0; so that 
by (14), P(Zz) — 0. Thus the effective transmission rate R ~ R. 
We will see that this results in a considerable improvement in P, 
over that of equations (4) and (5). Roughly speaking, we will show 
that the resulting exponent is increased from that in equation (5) to 
approximately 


E,(R) = [(C? — RP3P+C — R = 2c%(c3 — R. (15) 
The exponents E£,(R) and E;(R) are plotted in Fig. 1. Notice that the 


improvement is greatest in the neighborhood of capacity where (as 
R — C)E;(R) & (C — R) and E,(R) ~ (C — R)’/4C. 


II. SUMMARY AND DISCUSSION OF RESULTS 


_ The main result is given as a corollary to the following two theorems 
which provide information on the trade-off between P, and P(E,) as 
A is varied. The proofs are given in Section IIT. 


Theorem 1: Let {y;}i", be independent Gaussian random variables with 
unit variance and expectation 


ky, = (AT), (16) 


2c 


15C 











' Fig. 1— Exponents for white Gaussian noise channel: Eo(R)-one way exponent, 
Er(R)-répeat-request exponent. 
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Let M = e®", whereO0 < R < A/2 = C, and let A = 8(27)', where 
Cc? — (4R)' s 6 < ct — Ri. (17) 
Then 


M 
P(E,) = Pr U {yw Sy; + A} S2 exp (-[C?- RP aT}. (18) 
j=2 
Notice that 6 = 0 will satisfy (17) if R = C/4. In this case P(H,) = P, 
(see (8)), and (18) yields E,(R) = [C? — R3P(C/4 S R S CO), a fact 
which is contained in (5). In fact, the proof of Theorem 1 closely 
parallels the derivation of P, for orthogonal codes (for a one-way 
channel). 


Theorem 2: Let {y;}", be independent gaussian random variables with 
unit variance and expectation 


Ey, = (AT), (19) 
Ly; = 0, 2S75M. 
Let M = e®7, where 0 S R < A/2 = C, and let A = 8(2T)', where 
5> Cc? — (4R)}. (20) 
With RB and 6 held fixed, and 6, , 0. arbitrary but satisfying 
6, > 0, (21a) 
0<<|* (21b) 
5 — [C* — (4R)*] 
aca arn. 


then for T sufficiently large, 


Pr ON) ty; > ys + 4} 
g=e2 ii (22) 


2(1 + 6,) exp {—[(R? + 6 — 6)? + (C? — R? +4 6,)? — RIT}. 


Again notice that 6 = 0 will satisfy (20) if R > C/4. In this case 
also, (22) yields H,(R) = [C* — R'*), when R > C/4 (since 6 can 
be made arbitrarily small). 

Let us now use these theorems to find the value of A = 6(27)* which 
gives the smallest upper bound on P, without substantially changing 
the effective rate R = R[1 — P(L,)]. Since P, is a decreasing function 
of 6, we choose 6 as large as possible with the proviso that P(E,) — 0. 


P, 


IIA 
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From Theorem 1, this value of 6 is 


where y, > 0. If y: is sufficiently small, this choice of 6 satisfies (17) 
and (20). With 6 so chosen, for any y. > 0 we can find a T sufficiently 
large so that R = R(1 — yz). Further, substitution of equation (23) 
into equation (22) yields an exponent 


—[c? arene (ane 02)” + (Cc? —* R} + 65)” a R)T. 


Finally, since 7; , y2, 6; and @ can be made arbitrarily small we have 
our main result: 


Corollary: Let 6; > 0, € > O be arbitrary. Let R < C. Then for T 
sufficiently large, there is a repeat-request communication system using 
orthogonal codes with an effective rate of R and error probability 


P, S 2(1 + 6) exp {-[(C? — BR)? +C — R — T}. 


Let us turn our attention to (4) and (5) which give the error prob- 
ability for the one-way Gaussian channel. The fact that £,(R) S 
(Cc? — R})? can be demonstrated by a “sphere-packing” argument.° 
This argument states that P*(M, T) = Q, where Q is the probability 
of error which would result if it were possible to subdivide Euclidean 
M-space into M congruent cones (each with apex at the origin), one 
for each code word, and each code word were placed on the axis of its 
cone at a distance (AT)* from the origin. Setting the “sphere-packing 
exponent” 


E'sp(R) = (c? rae He), 


we have from the above corollary that for effective transmission rates 


R < C we can obtain an error exponent arbitrarily close to 
E;(R) = Esp(R) a Ot R. (24) 


For discrete memoryless channels it is possible to find a lower bound 
to the optimal (one-way) error probability using an analogous sphere- 
packing argument.” Forney showed that using a repeat-request strategy 
similar to the one used here, one can obtain an error exponent arbitrarily 
close to that of equation (24) [with the appropriate F'sp(R)].° Forney 
also studied the so called (discrete) ‘‘very noisy channel,’’ which is 
closely related to our Gaussian channel* and obtained results similar 


* Our Gaussian channel may be thought of as a “very noisy channel” since 
the signal-to-noise ratio per coordinate is zero. 
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to our results. Thus, in the light of Forney’s results, the above corol- 
lary is not surprising. 

Let us also remark that Kramer has found a scheme for our white 
noise channel with a feedback link that attains an error exponent of 
C — R, which is less than that in equation (24).* In Kramer’s scheme, 
the receiver observes the signal until it is sufficiently confident that 
one particular message was actually transmitted. It then informs the 
transmitter, via the feedback channel, to start the next M-ary trans- 
mission, thereby using the feedback channel only once per M-ary 
message. In the repeat-request scheme studied here, the number of 
uses of the feedback channel per M-ary transmission is an unbounded 
random variable. Thus the two schemes, while similar (in that the 
feedback channel is used only to convey a “decision”), are not di- 
rectly comparable. On the other hand, there are schemes which use 
the feedback channel considerably more heavily (so called “informa- 
tion feedback”) which in some cases attain somewhat better per- 
formance than the repeat-request strategy. (See for example Refs. 5, 
6, and 10). 

Finally, an important problem which has been completely ignored 
here is the requirement that the transmitter have a buffer in which it 
can store data which will accumulate at the transmitter at times when 
the receiver asks for retransmissions. If the buffer has finite capacity, 
it will occasionally overflow, introducing a further source of errors. 
Some quantitative results on this problem have been obtained by the 
author, and will be reported in a future paper. 


III. PROOFS OF THEOREMS 


We begin with some definitions. Let 
g(a) = Gap ex (—a"/2), —@ <a<o, 
be the standard Gaussian density, and let 
eu) = [| g@da, -» <u<o, 
be the cumulative error function, and let 
2) = [ g(e)da=1-W), 2 <u<e, 


be the complementary error function. Let b = (AT)? = (2C T)? so 
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that y, has density g(a — b) and y; (2 S 7 S M) has density g(a). 
We will use the following 


Lemma 1: Foru = 0,®,(u) S exp (—u’/2); and for u S 0, &,(u) S 
exp (—u’/2) (Wozencraft and Jacobs Ref. 8): 


Proof: For u = 0, 


(econ = [°° oe@a@ dads s ff o(edo(@) da dp = SP) | 


where ® = {(a, B): a’ + 6’ = 2u’,a = 0, B = O}. Taking square roots, 
we have 


&(u) < exp (Kv?) < exp (—u?/2). 


The rest of Lemma 1 follows on noting that @(u) = ®,(—u). 


Proof of Theorem 1: Let R (0 < R < C) and 6 satisfying (17) be 
given. Since y; has density g(a — b), and the {y;}}’ are independent, 


P(E,) = Pr U {y; 2.4: — A} 


* t least one 
=] d —»d)P o 
im z glo ) x Y; = Yi — A 





Y= a} (25) 


= [| dag) Pr ly; 2 a— Al. 


7=2 


Now since the y; (j > 1) have density g(a), 


M 1 
Pr UY ty 2a — A) a eee < M&(a— A). 
(26) 


Letting a be a parameter to be specified Jater, we break the integral of 
equation (25) into two parts, a S a and a 2 a. We then apply the first 
upper bound of (26) in the first part, and the second bound of (26) 
in the second part. Thus 


P(E, < a De eas l Fe CN 


If we assume that 
A, (27) 


rs) 
IV 
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we can use the bound of Lemma 1 on ®,(a — A) and obtain 


P(E, < [. Fe ee i joni ae anes 
= P, + MP. (28) 


We now overbound P, and P, . First, 


Pi=[ ga-dda=f 


If we further assume that 


g(a) da = P(a — Db). 


ab, (29) 
we can use Lemma 1 and obtain 
P, S exp [—(b — a)’/2]. (30) 


Second, 


P= | oy exp [—#(@ — 0)*] exp [—H(a — A)"] da 


= [chew |-(¢- 244) ] ew (0 - 9/4 ae 








exp LAA), Le. ie 
+ /2 (27)? oe wed ( : @ o 
_ exp [—(b — A)?/4] { ; 7 (2 + 4). 
= ames aaa ®, V/2 a 9 
If we now make a third assumption that 
a2 : + = ; (31) 
we can use Lemma 1 again (and 2~? < 1) to bound P, : 
2 
P, S exp [—(b — A)*/4] exp {-[a — (2+) } 
(32) 


exp [—(b — a)’/2] exp [—(a — A)’/2]. 
Inserting the bounds on P, and P, into (28), we obtain 
P(E,) S exp [—@ — a)’/2]{1 + M exp [—(@ — A)*/2]}, (88a) 
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where from (27), (29), and (81), 
A 


b+A 
2 


IIA 
Q 

IIA 
ao 


(33b) 


It remains to choose the parameter a. A good choice will probably 
result when the upper bound of (28) is differentiated with respect to a 
and the result set equal to zero: 


g(a — b) — Mg(a — b) exp [—(a — A)’/2]= 0 


or 
M exp [—(a — A)’/2] = 1, (34a) 

or since M = exp (RT) and A = 6(2T)?, 
a = (R' + 8)(27)'. (34b) 


Let us now verify that when 0 < R < C, constraints (33b) are satisfied 
for this choice of a. Since R > 0, a = A. Further, since b = (2CT)}, 


- (244) = ace - am | CO] 
since 6 satisfies (17). Finally, from (17), 
b—a=[C’— (R} 4 )](27) = 0. 
Thus constraints (33b) are, in fact, satisfied. Thus from (84) and (88a) 
P(E,) = 2 exp [—(C? — R — 8), 


IV 


0, 


which is Theorem 1. 


Proof of Theorem 2: Let R (0 S R < C), 6 > C® — (4B), and 
6, , 9. satisfying equation (21) be given. Then 


Af 
P.=PrUN\\tyi<y,- A} S YP) fy; <4, — A}, 
7=2 t¥7 tA 
or 


P,SMPrf\ {ys <y, — A}, j= 2. 
tq 
The last inequality follows from the symmetry of the distributions of 
the y; G 2 2). Recalling that the density for y; (j 2 2) is g(a), and that 
the {y;}/" are independent, 
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Yi = a} 


M J gl) da Pr (Y tys <a — Ay. 


tAi 


forall 74] 
yi sya 





=a |” G0) da Pr} 


l 


Again using the independence of the y, and the fact that the density of 
y, is g(a — b) we have 


a OS ce gla — b) da || f° g(a) da | 


Ba — A — b)[®(a — A)”. 


Substituting, we obtain 
P.<M / il A= Bybee cA de. (35) 


Also note that 
[B(@ — A)]"~ 


[1 — &(@ — A)" 
exp [—-(M — 2)&,(a — A)]. 


(36) 


IIA 


As in the proof of Theorem 1, we break the integral in (85) into two 
parts a S a and a 2 a, where a will be specified later. In the range 
a < awe overbound @(a — A — b) by unity, and [@(a — A)]”~” by 
(36). In the range a = a, we overbound [@(a@ — A)]”~* by unity. Thus 


P.<M [ "We eat = Daa ae 


+M i "Sep SGr = Aoi da PO Ps.. 16H 


We now overbound P, and P,. First, 


P, = ‘3 ; epithe -aa 
< exp (—(M — 2)@,(a— ay] | g(@) de (38) 
= exp [=O = 2)e4G = Ap). 


Second, if we assume that 


asb+A (39) 


84 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1969 


we can write 


Ps ‘- FS ee Gare Ne 


= | ce eres [. Fe me ee 


In the first integral, a — A — b S 0, so that we may use Lemma 1 to 
bound ®(@ — A — b). In the second integral, we overbound (a — A — b) 
by unity. Thus 


Ps f[ g@ep(-@-A-H/2Idat+ | ge) da 


< | o(@) exp[-(@ — A — '/2] da + (A + 8). 
Since from (20) and the fact that R < C, 
A+b=(6+ C27) > 2(c? — R27)? > 0, 


we can again use Lemma 1 to overbound ®,(A + 6). Using the definition 
of g(a), we have 


P, < I Gal Sse Der ns A= a 
+ exp [—(A + 6)’/2] 
= exp [—(b + A)*/4] on if exp | -(« — b+) | da 
+ exp [—(A + b)’/2] 
—-1/2 i) 
exp lab Ay 7/4) on Vila—(b+A)/2) a 
+ exp [—(A + )’/2] 
S exp [-—(6 + ayyajo| v3(0 — ot A) + exp [—(A + 6)*/2]. 
If we further assume that 
a= (b+ A)/2, (40) 


then we can again employ Lemma 1 to bound &,[+/2(a — (b + A)/2)]. 
Hence 
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P, S exp [—(b + A)’/4] exp {-[a - Gall 


= exp {—4[a° + (a — A — b)"}} + exp [—(A + B)*/2]. 


The difference between the second and first exponents in (41) is 


—3{(A + 6)’ — [a + G@ — A — b)’]} = ala — (A+ B)] S0, 


by (89) and (40). Thus, the first term of (41) is not less than the 
second, and 


P, S 2 exp {—3[a” + (a — A — b)*]}. (42) 
Inserting the bounds on P,; (38) and P, (42) into (37), we obtain 
P, S M exp [—( — 2)®.(a — A)] 
+ 2M exp {—}3[a° + (a — A — b)’}}, (43a) 
where from equations (39) and (40), 
oto casbta. (43b) 
It remains to choose the parameter a, and here we will simply state 
a good choice of a without giving a motivating argument. Let 
a = (Ri + 5 — 6,)2T) (44) 


(where 62 is the arbitrary parameter which was selected at the begin- 
ning of the proof). We must verify that constraints (43b) are satisfied 
for this choice of a. First, since R < C and 6 > 0, 


b+A—a= (Ci — R + 6,)(2T) > 0. 
Thus a S b + A. Second, from equation (21b), 


a— (LEA) = 4 — Ic! — amy — 20,107) 2 0, 


so that a = (b + A)/2 and (48b) is satisfied. 


Now consider the second term in (48a). Direct substitution of (44) 
shows that this term is 


2M exp {—[(R? + 6 — 0.) + (Ct — R? + 6,)*]T}, 


a single exponential decay in T (as T — o). Finally consider the 
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exponent of the first term of (48). Substituting (44), it is 
—(M — 2)@,(a — A) = —(exp (RT) — 2)8,{[R? — 0,](2T)'}. 


Making use of the asymptotic formula ®,(u) = (2ru)7e™’’” as 
u— © (see p. 106 of Ref. 2), and letting T — © (and noting that from 
equation (21b), R? — @, > 0), this exponent is asymptotic to 


OnE = ayere oP NT) 


where K > 0. Thus the first term of equation (43a) decays to zero as a 
double exponential in 7, very much more rapidly than the second term 
of equation (48a). We can find a T sufficiently large so that the ratio of 
the first to second terms of equation (48a) S$ 6, . With T so chosen 


P, < (1 + 6,)2 exp {—[(R? + 6 — 6,)? + (C? — R' + 0,)? — RJT} 


which is Theorem 2. 


REFERENCES 


1. Hane, R. M., Transmission of Information, Cambridge, Massachusetts: MIT 
ress, 1961. 

2. Feller, W., An Introduction to Probability Theory and Its Applications, 
vol. I, New York: Wiley, 1950. 

3. Forney, G. D., “Exponential Error Bounds for Erasure, List, and Decision 
Feedback Schemes,” JEEE Trans. on Information Theory, /T-14, No. 2 
(March 1968), pp. 206-220. 

4. Kramer, A. J., “Use of Orthogonal Signaling in Sequential Decision Feed- 
back,” Information and Control, 10, No. 5 (May 1967), pp. 509-521. 

5. Kramer, A. J., “Analysis of Communication Schemes Using an Intermittent 
Feedback Link,” Stanford University Center for Systems Research, Tech. 
Rept. No. 7050-11. 

6. Schalkwijk, J. P. M. and Kailath, T., “A Coding Scheme for Additive Chan- 
nels with Feedback, IEEE Trans. on Information Theory, /T-12, No. 2 
(April 1966), pp. 172-188. 

7. Shannon, C. E., Gallager, R. G., and Berlekamp, E. R., “Lower Bounds to 
Error Probability for Coding on Discrete Memoryless Channels,” Infor- 
mation and Control, 10, No. 1, 5 (January and May 1967), pp. 65-103 and 
522-552. 

8. Wozencraft, J. and Jacobs, I., Principles of Communication Engineering, 
New York: Wiley, 1965. 

9. Wyner, A. D., “On the Probability of Error for Communication in White 
Gaussian Noise,” JEEE Trans. on Information Theory, JT-13, No. 1 
(January 1967), pp. 86-90. 

10. Wyner, A. D., “On the Schalkwijk-Kailath Coding Scheme with a Peak 
Energy Constraint,” IEEE Trans. on Information Theory, [T-14, No. 1 
(January 1968), pp. 129-134. 


Second and Third Order Modulation Terms 
in the Distortion Produced when Noise 


Modulated FM Waves are Filtered 


By S. O. RICE 
(Manuscript received August 8, 1968) 


This paper is concerned with the distortion produced in a frequency 
modulation wave when it passes through a filter. The phase or frequency 
modulation representing the signal 1s assumed to be a band of gaussian noise. 
The main result is an expression for the power spectrum W,(f) of the output 
phase angle 6(t). This expression holds for any filter, contains all of the 
distortion terms due to second and third order modulation, and is suited 
to computer evaluation. It ts useful in many cases, but it has the shortcoming 
of not containing any modulation terms higher than the third order. 

A second result ts an approximation to W4(f), based on log 1 +2) 2 
(that is, a “first-order” approximation), which is encountered in the deriva- 
tion of the main result. Although it does not contain all of the second and 
third order modulation terms, it does contain higher order modulation terms 
which may give most of the distortion in some cases. The results given here are 
compared with those obtained earlier. 


I. PREFACE 


This work is a sequel to “Distortion and Crosstalk of Linearly 
Filtered Angle-Modulated Signals” by E. Bedrosian of the Rand 
Corporation and myself.t One of the principal results of that paper 
is an expression for the distortion produced when a frequency modula- 
tion wave, modulated by gaussian noise, passes through a filter as- 
sumed to be symmetrical about the carrier frequency. 

The assumption of symmetry simplified the analysis, but led to 
zero second order modulation; and consequently the results do not 
apply to many cases of practical interest. 

The main result of this paper is an expression for the distortion 
which contains all of the second and third order modulation terms pro- 
duced by a general filter. It includes the earlier result as a special case. 
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As before, the input phase angle is assumed to be a gaussian noise. 

Some time. ago Mr. Bedrosian and I worked out, independently, 
the second order modulation terms. His analysis is somewhat different 
from mine and throws a different light on the problem. Since each ap- 
proach is of interest in its own right and since we were unable to 
combine the two without losing useful results, we decided to publish 
our work separately. An early version of Bedrosian’s analysis is given 
ina RAND memorandum.’ 


II. INTRODUCTION 


When an angle-modulated wave (FM or PM) passes through a 
filter, the signal becomes distorted. For a multichannel system this 
distortion may produce crosstalk. In many practical cases the second 
and third order modulation terms give a good measure of the distor- 
tion. These terms have been studied by a number of investigators. In 
this paper we obtain some general expressions for them for the case in 
which the modulation is gaussian. 

Our main results includes all of the second and third order modu- 
lation products. In this respect, it is more general than some of the 
earlier expressions for the distortion (see Medhurst,? Magnusson,‘ and 
Liou®). However, it does not give higher order modulation terms, some 
of which appear in earlier “first order” approximations. A first order 
approximation (similar to the earlier ones) occurs in our derivation of 
the main result. It is stated, along with the main result, in Section III. 

As in Ref. 1, the complex form of the filter input is 


st) = exp [twot + to()] (1) 
where the carrier frequency 1s w) = 27f, and the signal is carried by the 


real input phase angle ¢(t). Let the filter have the transfer function G(f) 
and the impulse response g(é): 


a) = | GW exp Gt) af, 
Fe Q) 


Gi) = | old exp (iat) dt, w = 2nf, 


where the response g(t) may be complex and may be different from zero 
when t < 0. 

The filter is regarded as a bandpass filter for which G(f) is large only 
near +f, . Let normalized functions be defined by 


_ Hho +f) _ g{t) exp (—twot) 
T() = G(fo) ’ v(d) me G(fo) (3) 
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From the definitions and the Fourier relations (2) it follows that 


vi) = [1 exp Ged af, 1) = [1 exp (—twt at, 


(4) 
[ woa=1,  r—p= [vO exp (—tod at, 
where the asterisk denotes ‘‘conjugate complex.” 
The filter output corresponding to the input s(t) is 
so) = i g(u)s(t — u) du 
= | ow) exp bint — u) + ie(t — a] du 
i (5) 


= Ee [ ; v(u) exp [7e(t — u)] in| exp (wot) 


exp (—ao — 7B) {R(d) exp [76(t)]} exp (wot) 
exp (~a. — 78») {exp [2O(1)]} exp (doi). 


The definition of y(w) is used in going from the second to the third line. 
In going from the third to the fourth line, the attenuation and phase 
shift, ao , Bo at the carrier frequency have been introduced by writing 
G(fo) as exp (—a — 789). The complex phase angle O(t) is related to 
the envelope R(t) exp (—a,) and phase angle —8 + @(¢) of the output 
[a and 8 are constants which do not depend on ¢(£)] by 


exp [7O(1)] = R(d) exp [2@(d], 2O(t) = In R(t) + 70d), 


(6) 
6(t) = Re O(2), In R(t) = —Im Of). 
Comparing the third and fifth lines of equation (5) leads to 
O(t) = —iln | i y(u) exp [ie(t — u)] in|. (7) 


The analysis may be simplified by introducing the “linear portion” 
&(t) of O(f). Working with the case in which the input ¢(#) is small gives 


OO = —i1n E +if ywett —u)dut+ oA 


wi (—oi | yedolt — w) du, 
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and this leads us to define the linear portion of O(£) as 
a) = [| yaelt — w du. (8) 


The complex phase angle O(t) may be separated into its linear and 
nonlinear portions by adding and subtracting 7#(t) in the exponent in 
equation (7): 


O(t) = &(t) —7tIn i y(u) exp [ie(t — u) — 76(2)] in| . (9) 


Adding and subtracting 1 in the integrand and using the fact that the 
integral of y(u) is unity gives the fundamental relation 


O(t) = (7) —7#Inf{1+ KO] (10) 
where 


K() = [du yea) texp [ie(t — a) — #(Y) -— 1). AD) 


In most cases of practical interest, K(é) tends to be small. When 
| K(t) | < 1, expansion of the logarithm gives the series 


Q(t) = of) — iK(t) + 2K _ 2K aoe (12) 


upon which our analysis is based. When ¢(é) is gaussian, | K(é) | will 
occasionally exceed unity. It appears that results obtained from the 
series of equation (12) represent the first few terms of an asymptotic 
series. This is further discussed in Appendix F. 

If ¢(£) is small for all values of t, expansion of the exponential in the 
definition of K(t) [equation (11)] shows that K(t) is 0(¢’). Our main 
expression for the distortion, given in Section III, neglects terms of 
order ¢°. For this accuracy, equation (12) can be written as 


O() = H() — KW) + 2K + Of. (13) 


Since the variable portion 6(t) of the output phase angle is the real 
part of O(4), the de portion, 6,, , of @(é) is the average value of Re O(2). 
When the input ¢(t) is a stationary gaussian process with zero mean, 


Bac = Re (4) 


RS (—aK(o fe 5x) + Oly) (14) 


Im (K(t) — 2" K*(t))wv + O(@’) 
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where ( ),y denotes “ensemble average” and (®(t)),, is zero because 
#(¢) depends linearly on g(t). Notice that from equation (5), the total 
output phase angle is 6() = —8, + 6(é) and that the de part of 6(£) is 


Bae bos —Bo + B26 . 


The two-sided power spectra W5(f), Wo(f) of 6(é), 0(¢) contain the de 
spikes (—Bo + 6..)° 6(f), 6%, 6(f), respectively. Furthermore, 


Wilf) = (Bo — 2806.) S(f) + Wolf). 


Here 6(f) is the unit impulse function. 
In the following work it is convenient to ignore 8), and we shall 
call 6(¢) itself the output phase angle. 


III. STATEMENT OF PRINCIPAL RESULTS 


In all of the results stated here, the input phase angle ¢(¢) is gaus- 
sian with zero mean. The two-sided power spectrum of y(t) is Wo(f). 
The two-sided power spectrum of the output phase angle 6(t) is Wo(f). 


3.1 Second and Third Order Modulation Terms in W,(f) 
The principal result given in this papery is an expression for Ws (f) 
which contains all of the second and third order modulation terms: 


Wolf) = 62. 5(f) + 4W.(f) | UG) + U*(—-/ I 


5 a dp W.(e)W.(f — ) | Tle, f — 0) — T*—p, —f + ») 


+- 5 [. dp ia dao W (pW (o)W.f — p — @) 


-| S(p, Ce Mc p ~— g) vie t= py —o, oa lc p +a) |’ 
+ O¢°W,). (15) 





+ Note added at press time: Equation (15) gives essentially the first few terms 
of a general expansion due to A. Mircea, Rev. Roum. Sci. Tech—Electrotechn 
et Energ., 1967, ¢. 12, No. 3, pp. 359-871, and Proc. IEEE (Correspondence), Octo- 
ber 1966, 54, pp. 1463-1466. I regret the oversight of Mircea’s excellent work. Use 
of his results would have substantially improved this article. 
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Here the de part, 64., of 0(¢) is the imaginary part of 
1 eo 
D, = -3[ dp W ,(p)S(p, —p) 


; ie ae [ : do W,(p)W ,(a)S(—o, —p)[$S(c, p) — To + 9)] 


+ O(¢’) (16) 


and 
T(p, f — p) = Slo, f — p) 
= i: do W ,(c)[2S(c, p)S(—o, f — p) — Sle, f — a) 
— T(o)I(—o)S(p, f — p) + Sip + 0, f — p — o)] (17) 


UH =1H) + | do WLT)S(—o, 1 


+ [do [de W)C) Ilo + )S(—p— 7, FTO) 


-[3S(—c, p)S(—p, f) _ S(p, f — p— a) + S(p —_ od, j = p)]} . 

(18) 

The I'(f) is the normalized filter transfer function defined by equa- 
tion (3) and the functions S are discussed in Appendix B. They depend 


only on the filter. That is, they are independent of W,(f), and are 
defined symbolically by 


Si, --+ 2) = Tb - re] (19) 


where the power y* of y is to be replaced by I'(z) after multiplying 
out the product. The S’s are symmetric functions of their arguments. 
For n = 2 and 8, 


S(p, 0) = y?" — yTo) — y'T(e) + T)T) 
= I(o + o) — To)T(o), 
S(p,o,v) = Tip to +7) — I(p + o)TQ) 


— (p+ To) — Te + »)T(p) + 2T(o) TOTO). 
The S(p, o, v) of Ref. 1 is the negative of the one used here. 


(20) 
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In the “order of” symbol appearing in equation (15) the W, enters 
for dimensional reasons. This is in line with 


[ 4.0 = Om (21) 


as can be seen by integrating both sides of equation (15) from f = 
—o tof = o. 

In some instances the expression for Ws(f) is useful when rms ¢(¢) 
is large but rms d¢(t) /dt is small. 

Bedrosian has computed curves showing the second order distor- 
tion for the case of quadratic phase shift, that is, for I'(f) of the 
form exp [tAf?].? 

As equation (15) stands, it is oriented towards phase modulation. 
For frequency modulation the time derivatives ¢’(£) = de(t)/dt, 6’(t) = 
d6(t) /dé replace the phase angles ¢(é), @(é) as the items of interest. Since 
the power spectrum W,.(f) of 6’() is equal to (2rf)’W.(f), multiplying 
(15) by (2rf)’ converts it into an expression for W,,(f). On the right side 
of (15), the factor W,(f) in the first line is replaced by W,.(f) and the 
W,’s appearing in the integrands may be transformed into W,.’s with- 
out introducing infinities. The last statement is seen to be true for the 
second order modulation integral when we write 


W (p)W o(f — p) = We (e)W,(f — p) (20) *p *(f — p)? 


and observe that the product p'(f — p)~'T(p, f — p) remains finite 
even when p and (f — p) approach zero. The third order term may be 
treated in a similar way. 

In many applications W,.(f) is proportional to D® where 


oe), 


and D is the rms frequency deviation in cycles per second. Then the 
second and third order modulation integrals in (15) are proportional to 
D* and D*, respectively, as D tends to zero. This suggests that the 
remainder term, 0(¢°W ,), is proportional to D®. For this reason we shall 
sometimes refer to (15) as the “‘small deviation” approximation. When 
the FM signal to crosstalk ratio in dB is plotted as a function of log D, 
the behavior of the resulting curve as D — 0 can be computed from (15). 
Indeed, if the second order modulation predominates, (15) furnishes an 
asymptote to the curve with a slope of 6 dB per octave. If, because of 
symmetry in the filter, the second order modulation term in (15) is 
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zero, the third order term gives an asymptote with a slope of 12 dB 
per octave. 

Some idea of how equation (15) begins to fail as D increases from 0 
may be obtained by considering the case g(t) = A sin w,t. Then the rms 
frequency deviation is D = f,A/2? and the filter output is 


sol) = Yo SuCAVGLHe + mf.) exp [ian + ref 


where J,(A) is a Bessel function and w, = 2afa . Consider only the 
second harmonic. It is proportional to J,(A), and the approximation 
underlying (15) is roughly equivalent to replacing J2.(A) by A’/8, the 
leading term in its power series. The value of A which makes A’/8 
exceed J2(A) by 3 dB is A & 2.0 and the corresponding D is 1.4 f, . If 
the baseband of a gaussian FM wave were flat and extended from 0 to B, 
the expected number of zeros per second would be 1.16 B. This is the 
same as the number of zeros of A sin w,¢ with f, = 0.58 B. This represen- 
tative value of f, leads to the estimate that (15) will be in error by 3 dB 
when D &% (1.4) (0.58)B ~ 0.8 B. Comparison of (15) with experimental 
values indicates that the 3 dB error point typically occurs when D lies 
between B/2 and B. 


3.2 Power Spectrum of a6(t) + bln R(t). 

Equation (15) for Wse(f) may be modified to give information re- 
garding the fluctuation of the envelope R(t). This information may 
be of interest, say, in determining the distortion produced by “AM to 
PM conversion.’ More generally, suppose that one is interested in the 
power spectrum W,,(f) of 


x(t) = ad) + bIn RM = Re [(a + 120)0@] (22) 


where a and b are arbitrary real constants. Then W,(f) is given by 
an expression obtained from equation (15) upon replacing U(f), 
T(p,f—p) and S(p, o, f—p—o) by (a+b) U(f), (a+%b) T (p, f—p), and 
(a+2b)S(p, o, f—p—o), respectively, so that U* (—f) is replaced by 
(a—1b) U* (—f), and so on. (See Appendix E.) 


3.3 Second and Third Order Modulation Terms for ‘Small and Slow” 
Frequency Deviations 


The expression (15) for W,(f) simplifies when 


(2) If) can be expanded as a power series 


DISTORTION IN FM WAVES 95 


r(f) = ED Tae Soar (23) 


n=1 n! 





and 


(iz) the effective spread of W,(f) is so small that the IT’s used in 
equations (15) to (20) can be replaced by the first few terms of their 
power series expansion. Roughly, this means that the top baseband 
frequency is small compared with the filter bandwidth. The instantaneous 
frequency changes slowly in comparison with the envelope of the impulse 
response of the filter, and the quasistatic case is approached. With these 
assumptions the resulting simplified form of W,(f) is given by equation 
(126). A more complete form of the small and slow deviation approxi- 
mation is given in equation (133) which brings out the asymptotic 
nature of the results. 

The sum of the second and third order modulation terms given by 
the integrals in equation (126) [which are the simplified versions of 
the corresponding integrals in equation (15) | is 


£) = 20a + 2D? [de WA — eV — 0° 


+670)? | de | do W.(e)W (Wolf — p — o)p*o%(f — p — 0)’. 

ee (24) 

Here W3(f) is the portion of W,(f) which gives the interchannel inter- 

ference, that is, the noise a listener would hear in an idle channel in a 

multichannel frequency division multiplex angle-modulation system. 
D is the rms frequency deviation in cps, 


v= (ef), : 


where ( ). denotes ensemble average. The quantities \,; are the im- 
aginary parts of the semi-invariants \, defined by the expansion 


nhrf= > uh (26) 

n=1 Tbe 

Equations (125) express the first five \,’s in terms of the first five a,’s, 
the coefficients in the expansion of I(f). 

The corresponding approximation for the power spectrum W7(f) 

of x = aé(t) + b In R(é) [see equation (22)] is obtained by replacing ),,; 

in equation (24) by (ad,; + bA,,-) where \,, denotes the real part of A, . 
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In the important FM case in which the baseband signal ¢’(t) has 
a flat power spectrum, the two-sided power spectrum of ¢’(t) may be 
taken to be 


0, lf|>B 
(2rD)’/(2B), |f|<B 


where the baseband extends from f = 0 tof = B (that is, from —B 
to +8 for two-sided spectra), and D is the rms frequency deviation. 
Then 


Wef) = | (27) 


0, If|>B 


Wo) = Wef/Onf? = | (28) 
D/Q2Bf), |f| SB. 


When this expression is used, the integrals in equation (24) may be 
evaluated and it is found that, for 0 $ |f| S B, 


I ss D* coat -1 772 2 D 2 42 2 
s() = ga OB — | f )Qee + 2°D*N)® + gees BB — AO, 





OS|f|SB. (9) 


The average signal power in an elementary frequency band extending 
from f to f + Af in the input base band is W,,.(f) Af (radians per second)’. 
The ratio of the interference power to the output power in the same 
elementary band is 


Wilf) Af _ WHA). 
Wo(f) Af — Wilf) 


For the flat baseband FM case we may approximate W,(f) by W,(f) = 
D’/(2Bf’) and use equation (29). This leads to the approximation 


peo Z rp (2 = liNa,, 427 D%,,)? + B (3 = Boao | ; 





(30) 





(31) 


for the ratio of the interchannel interference power to the signal power 
in the elementary band (f, f + Af). This ratio has meaning only if 
lfl s B. 

Liou has given an approximation which is equivalent to equation 
(24) for W3(f) with several more terms included.” This approximation 
is discussed in Section XII. 

The small and slow deviation approximation described above gives 
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results which agree well with Monte Carlo computations made by C. L. 
Ruthroff.° For illustration we take the simplest of Ruthroff’s cases, 
the one in which the transfer admittance of the filter is 


Gf) = a x= Uf — fr)/fe . (32) 


Here f, is the carrier frequency and f, is the filter semibandwidth. 
Putting f = f, gives G(f.) = 1. Putting f = f’ +f. gives x = 7f'/f, 
and 


in rg) = md d 


—In (1 + 2x + 22° + 2’) 


(33) 
xv xe 2 


=-»4 


n=1 


The last line follows from the series of equation (26) defining the ),’s 

as the coefficients in the expansion of In I'(f’). In going from line 2 to 

line 8, the logarithm is expanded by setting a, = 2, a, = 4, a; = 6, 
= Oforn > 3in 


In (1 -{- 9 a,a/n!) = 3 Aw’ /n! 


and by using the expressions (125) for the ,’s. 
Substituting x = 2f’/f, in line 3 of equation (33) and comparing the 
result with the last line gives 


Ay = —2(tf-"), do = 0, As = 2(zf.")°, Ne = 0. 


Hence Az; = 0, A3; = —2f7°, \4; = 0, and the approximation of equation 
(81) for the ratio of the interchannel interference power to the signal 
power leads to 


: Wi Af [2 (a - £)|. 
10 logy, W(f) Af ~ 10 logio oF B (34) 
In his Fig. 15 Ruthroff has plotted values of 


—10 logic [Wa(f) Af/Wo(f) Af] 
for several different values of D/B and f/B with B = 7 MHz and 
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f. = 119 MHz.° The agreement with our equation (34) is good at 
f/B = 1.0. Our D/B is the same as Ruthroff’s ¢/W and our f/B = 1.0 
corresponds to Ruthroff’s slot 10. At f/B = 0.4, equation (84) gives 
values which are about 3 or 4 decibels less than the Monte Carlo values, 
but this may still be regarded as good agreement. 

Similar agreement is found when the small and slow deviation approxi- 
mation is applied to a number of the other cases examined by Ruthroff. 


3.4 <A “First Order’ Approximation 


Although equation (15) is useful in some FM distortion problems, 
in some cases it is of no help. One example concerns the distortion 
produced by an ideal filter centered on the carrier frequency and having 
a semibandwidth exceeding 3B, where B is the baseband. That is, 
W(f) is O for | f | > B. In this case the distortion is produced by modula- 
tion terms of order higher than the third, and these are neglected in 
equation (15). 

For such problems “first order” approximations can sometimes be 
used. The term “‘first order’ refers to the approximation In (1 + 2) Sz 
where x is of the nature of K(é) in equation (10); it does not refer to the 
order of the modulation products in x. Different choices of x lead to 
different first order approximations. The first order approximation 
given by the first two terms in the series of equation (12) for O(é) is 


6(t) © Re @(¢) + Im Kd). (35) 
The output phase angle @(¢) may be written as the sum 
Ot) = Bae + O:(t) + One() 
where 0,(t) = Re &(£) is the “‘linear portion” of 6(¢), and 6,,(t) given by 
One(t) = A(t) — A(t) — Bae 
a(t) — Re &%) — 4. 


I 


(36) 


is the time varying part of the ‘nonlinear distortion” in @(¢). The first 
order approximation for 6,,(¢) corresponding to the first order approxi- 
mation of equation (35) for @(£) is 


0.) Im K(t) — be & Im KK) — Im (KW) = 9) 7) 
where 
y() = Im [KW — K@®)wl. (38) 


The work of Section VI, which is part of the derivation of equation 
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(15), shows that the power spectrum of y(é) is 
WG) = 2° Re [PG) + QG) + P*(—f) + Q*(-/)] (39) 


where 


PQ) = = [dr exp (—twr) [au [dv ver) 
-exp [alu) + a()}exp [2e(u, », 7] — 1) 
a) = § [ar exp (ior) [au [do yenre) 
-exp [a*(u) + a(v)]{exp [2é(u, v, 7)] — 1} 
aw) = —$ [af W.HOH-D a 
ou, 0,2) = 5 [df WQHA-DHA) exp (iar) 


atu) = 5 [df WHHL exp Ger) 


H,(f) = exp (—wu) —TQG), @ = 2rf. 


The function Q(f) is real when f is real, and P(f) is an even function of f. 

The first order approximation W,(f) for the power spectrum of @,,¢(t) 
contains some higher order modulation terms which are not contained 
in our main equation (15) for W,(f); conversely, equation (15) contains 
terms which are not in W,(f). In using the first order approximation of 
equation (35), which may be rewritten as 


6(t) = Re S(t) + Im K() + O(K”), 


one should guard against throwing away* [in the O(K”) terms] quantities 
which are of the same order as those being computed from Re #(t) + 
Im K(é) (the leading term). Although each case requires its own in- 
vestigation, it is helpful to remember that K(é) is O(v’). Furthermore, 
when y(t) is real, Im K(é) is O(’). Also when I'(f) = 1, K(t) becomes 
0; and when 


lrg) -1l]<e<«l 


for all real values of f (as in the case of small wave-guide echoes), it 


*This type of error has been discussed by Enloe, Ruthroff,: Gladwin, and 
Medhurst in Refs. 7 and 8. 
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may be conjectured that K(£) itself is O(e) irrespective of how large 
(y" aw may be. 

The approximation @,.(¢t) & y(t), i is not quite the same as earlier first 
order approximations.*’*"°- It is a closer, but more complicated, 
approximation because y(t — u) — &(£) is used in the integral equation 
(11) for K() instead of y(t — u) — y(t) as in the earlier approximations. 
Appendix D gives some results obtained when the present analysis is 
repeated with y(t — u) — g(t) in place of g(t — u) — &(Z). 

Equation (89) for W,(f) gives a first order approximation for the 
power spectrum of @,,(¢) = @(f) — Re ®(¢) — 6,,. The corresponding 
approximation for the power spectrum of 


In R(t) + Im) — [In R®™|e. Y Re [KO — (KO))x] = x(t) 
is 
W.(f) = 2°" Re [—P@) + Qf) — P*(—-f) + Q*(-f)]. 
3.5 Simplification When Filter has Symmetry T(—f) = T*(f) 
When the filter has the symmetry 


Ria beg) (41) 
about the carrier frequency, the even order modulation terms disappear, 
S*(—2,,°++ , —%,) becomes equal to S(x,, °°: , 2), and equation (15) 


becomes 


Wa) = Wo) | UA) FP 
+5] aw] do W.W.OW.G - »- 0) 


| S(p, Go; , — p— o) |’ a O”'W,). (42) 


Here U(f) is still given by equation (18) and S(p, c, v) by equation (20). 
This expression for W,(f) agrees with one of the main results of Ref. 1 
when the double integral in equation (18) for U(f) is assumed to be so 
small that it may be neglected. 

When I'(—f) is equal to I'*(f), the coefficients a, in the power series 
of equation (23) for I'(f) are real when 7 is even, and imaginary when 
n is odd. The same is true for the d,’s of equation (26). Hence do; , Aa; 
are zero and the second order modulation terms disappear from the 
small and slow deviation approximations equations (24), (29), and 
(31) for W3(f). 

.. The relation '(—f) = I'*(f) implies that y(u) is real and that H,,(—f) 
is equal to H*(f). Then a(u), c(u, v, 7), and é(u, », r) are real and é(u, v, 7) 
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is equal to —c(u, v, rT). Both P(f) and Q(f) become even and real. Equa- 
tion (39) for the power spectrum of y(é), that is, the first order approxi- 
mation for the power spectrum of the nonlinear distortion 6,-(t), be- 
comes 


W,) = Pf) + QQ). (43) 


Here the triple integral for P(f) is the same as that given in equation 
(40); and the triple integral for Q(f) may be obtained from the integral 
for P(f) by changing the sign of 2c(u, v, 7). Hence equation (43) be- 
comes 


Wess i, : eases i: dis [ : aes 


-exp [a(u) + a(v)] sinh [2c(u, v, 7)] (44) 
where w = 2rf, and a(u), c(u, v, 7) are given by equation (40). 
IV. INITIAL EXPRESSION FOR THE POWER SPECTRUM OF @(t) 


When 6(£) is a stationary noise process its two-sided power spectrum 
W,,(f) is the Fourier transform of its autocovariance: 


Wf) = | ; exp (—tor)(O()0(L-+ 1dr, = nf. (45) 


Denoting functions with arguments t, ¢ + + by subscripts 1, 2 and using 
a(t) = Re Of) = 2°"[O® + OF] (46) 
gives 
(O()OCE + 2))av = (0,82) ie 
= 27 Re [(0,02).v + (O%@z)avl- 


The procedure of Appendix A and equation (13) for the complex phase 
angle O(t) lead to 


(0,02) av = A(r) F AtC7) + Oy") 


(48) 
(0%0.)av = B(r) + B*(—7) + O(’) 
where A(r) and B(r) are the ensemble averages 
A(r) = (@,(27-"*®, — iK, + 12°°K3) — 2° K,(K, — K2)).v (49) 


B(r) = (@¥(27'@, — 1K, + 127°K3) + 2°°K*(K, — K2))w - 
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The remainder terms in equation (48) are O(’) instead of O(v’) be- 
cause the ensemble average of an odd order term is zero. 

It also follows from Appendix A that equation (45) for the power 
spectrum of 6(é) goes into 


Wi(f) = 27! Re [P() + Q() + PX —A + Q(—N] + O@'W,) (50) 
Pi) = [exp (—iwn)A(e) dr 
pa (51) 

Qf) = [ exp (—twr)B(r) dr. 


Although the functions P(f), Q(f) used here are not the same as those 
in Section III, they are of the same nature. 


V. CALCULATION OF AVERAGES NEEDED FOR COVARIANCES 


The equations of Section IV show that the value of (@(é)@(¢ + 7))ay 
depends upon various ensemble averages of products of ®(é) and K(t). 
When the input phase angle ¢(¢) is gaussian, these averages may be 
computed by using a result proved in Ref. 1. 

Let L be a linear operator which operates on functions of ¢, and let 


L exp (twt) = exp (dwt) f(f), w = Inf. (52) 
Let g(t) be a stationary gaussian process with two-sided power spectrum 


W,(f). Then 


(exp (iEo(0)u = exw |-2[" a wenana-n|- 69 


Setting zLy(é) for Le(é) and comparing coefficients of x” in the power 
series expansions of the two sides of equation (53) shows that 


[at WMD = (Lele - 


That (exp [2Le¢(t)]). is equal to exp {—27*([Ly(t)]’).,} follows from 
the fact that the real and imaginary parts of Ly(f) are correlated gaussian 
processes. 

In dealing with K(é) it is convenient to introduce the function J(v, 7) 
defined by 


J(v, 7) = exp [ig(é + 7 — v) — 4+ 7)]. (54) 
The dependence of J(v, 7) on ¢ is ignored because the right side of 
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equation (54) is a stationary random process, and J(v, 7) will be used 
only to calculate ensemble averages. The examples, which follow from 
the definition of equation (11) of K(), 


(Ky)oe = (KO) = dural, 0) — Ua 


(Ki Ko) = (K()K(t + 7))av (55) 


— [ du ie dv y(u)y(v)([ J(u, 0) — 1J[J@, 7) — 1))ev 


(KK) = [du [adv rOMLI*, 0) — ULC, ) — Ue 


show that averages of the type (J(u, 0)).., (J(u, o)J(v, 7))av, and 
(J*(u, 0)J (uv, 7))ay are needed. 

To calculate (J(u, 0))ay from the general result, equation (53), let 
L be the operator which carries ¢(é) into gf — u) — &(t). Replacing 
&(¢) by the integral which defines it gives 


Lol) = ot —w - [as yielt - 9. 


The function ¢(f) associated with DL is obtained by setting exp (dwt) 
in place of ¢y(é): 


exp (twif(f) = Llexp (tw#)] 


exp [ta(t — u)] — i ds y(s) exp [tw(t — s)], 


{(f) = exp (—twu) — Tf) = H.(f). 
Then equation (53) gives 
(exp [2L¢(t)]). = (exp [g(t — u) — 18(4)])av 


I 


= (J(u, 0))av = exp [a(u)] (56) 
where 


aw) = 5 [af WOHOHL-D. (57) 


The functions a(u) and 


H,(f) = exp (—72rfu) — Tf) (58) 
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play important parts in the analysis. The present H,(f) is the negative 
of the one used in Reference 1, a change made to simplify the analysis. 
The calculation of (J(u, 0)J(v, 7))ay proceeds in much the same way. 
Let 
Le) = g(t — u) — &) + ot + 7-2) - 04 27), 


ff) = H.(f) + exp (iw7)H,(f), 


] 0 
—5]f FW MENa—-f 
2 I. 69) 


I 


if ; df W (NUL)H.A—) + HANH 


+ 2 exp (tw7)H.(—f)H.(f)] 
a(u) + av) + 2c(u, v, 7) 


where W,(f) is an even function of f and c(u, v, 7) is the integral 


I 


ctu, v, 2) = —5 [a WDH—-DHAD) exp (2nfx). (60) 


Consequently, 
(JU, 0) JQ, 7))av = exp [a(u) + av) + 2cu, v, 7)). (61) 
Similarly, to calculate (J*(u, 0) J (v, 7))ay let 


Le) = -ot — + HY +ot+r-)-— E+, — Ey 


£(f) —exp (—zwu) + i : ds y*(s) exp (—iws) ++ exp (iw7)H,(f) 


—H*(—f) + exp (twr)H,(f) 


where the Fourier transform of y*(s) is '*(—f). The work of equations 
(59) and (60) goes through much as before with —H*(—f) in place of 
H,,(f). The result is 


(J*(u, 0)J(v, 7))a = exp [a*(u) + a) + 2élu, », 7)] (63) 
where 


atu, 0, 2) = 5 [df WDHDH) exp @afr). (64) 


All of the averages needed are given in Tables I and II. Items 1, 3, 
and 6 in Table I have just been computed, and the others may be ob- 





NO PwhNore 


No. 


1 
2 
3 


TABLE J—ENsEMBLE AVERAGES OF PRopuctTs oF J(v, 7)’s 


Average ly) Value 

(J(u, 0) dav H.Af) exp[a(u)] 

(J(v, 7) dav xp(twr)H.(f) exp[a(v)] 
(J(u, 0)J(v, 7) dav Hf) ee xp(twr)H.(f) xpla(u) + a) + 2c(u, v, 7)] 
(J(u, o)J (2, r)J(w, 7) av H.Af) + exp(twr)[H.(f) + H(f)] aly i a(v) + a(w) + 2c(u, v, 7) 

a 2c(u, w, 7) + 2c(w, v, 0)] 

(J*(u, 0) dav H*(—f) xpla*(w)] 
(J*(u, 0)J(v, 7) av H.*(—f) z exp(twr)H.(f) exp[a*(w) kn ae) + 2é(u, v, 7)] 
(J*(u, o)J (0, 7) J(w, 7) av —H (- _f) + exp(twr)[H.(f) + A.(f)] exp[a*(u) + a(v) + a + 2é(u, v, 7) 


2é(u, UW, 7) + 2c(w, v, o)] 


TasBLE [J—ENSEMBLE AVERAGES OF PRoDUCTS CONTAINING g(t — w) 


Average ld) Value 
ae = es + 7 — 0))av ; ; we EWP) expliolr — v + u)] 
u)J(v, 7) dav x exp(—twu) -+ exp(twr)H.(f) af” dfWo(f) explia(r + u)]H mi ) aie (v)] 


(elt — ae r)J(w, 7))av | exp(—twt) + exp(ior) Hf) + Hulf)] | if dfW,(f) explia(r + uf) + Half) 


exp[a(v) + a(w) + 2c(w, », 0)] 


NI NOILHOLSIA 


SHAVA Wet 


GOT 
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tained in a similar manner. The entries in the last column of Table I 
may be verified by expressing the a’s and c’s as ensemble averages 
[see equation (67)] and using (exp [7Le(t)]).y = exp {—2° [Leo] )av}- 

Table II gives averages of products in which one factor is g(t — wu). 
The first average, (y(t — u)y(t +7 — v))ay, is the Fourier transform of 
W,(f). The second average, (y(t — u)J(v, 7))av , is the coefficient of tx 
in the expansion of exp ([zZ¢(t)]) where 


Le(t) = xe(t — u) + oi + 7 — v) — 0 + 7) 
ff) = x exp (—iwu) + exp (tw7)H,(f). 


The third average may be computed in a similar way. 
The following list brings together the panier a(v), c(u, v, 7), - 
which appear in Tables J and II: 


(65) 


aw) = -5 [af W.OHOHA-D 
ou, 0, 7) = ~5 [df WDHA—DHAD) exp (afr) 
— (66) 
aw) = 5] a WOH DHX-P 


u,v, 2) = 5 [af WHADHA) exp (afr) 
where H,(f) = exp (—72afu) — I(f), and replacing H,(f) by —H*(—f) 
in a(u), c(u, v, 7) gives a*(u), E(u, v, 7). Also 
a(u) = c(u, u, 0), c(w,v, 0) = cv, w, 0), 
ctu,v, —7) =c,u, 7), u,v, —r) = é*Q, u, 2), 
ctu, v, T) = —3([o(t — wu) — BM) JLo + 7 — v) — B+ 7)) av 
eu, v, 7) = alot — wu) — SMD Ile(E + 7 — 0) — OG + 7)])v - 


When I'(—f) is equal to I'*(f), both y(¢) and ®(¢) are real; and it 
follows that a(u), c(u, v, 7), E(u, v, 7) are also real. Furthermore, H,,(—/f) 
= H*(f) and é(u, v, rT) = —c(u, v, 7). 


(67) 


VI. THE POWER SPECTRUM OF K (t) 


The de portion of the complex random process K(t) defined by the 
integral in equation (11) is the complex constant 
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(K())ee = (Kade = [dur (Fee, 0) = Va re 
y 68 


= [du yeu) {exp fata) — 1). 


This follows from equation (55) for (K,),. and the expression for 
(J(u, 0))ay given in Table I. 

The power spectrum of K(t) is the Fourier transform of (K*Ko),, . 
The integral for (K*K,),, given by equation (55) and the ensemble 
averages of J*(u, 0), J(v, 7), and J*(u, 0) J(v, r) given in Table I lead to 


(KAK oe = (KKK) + [du [dv yar) exp la) + aQ)] 


-texp [2éu,»v, 7)]} — 1}. (69) 


Integrals of the type appearing in equations (68) and (69) may be 
expressed as infinite series involving the S functions [which depend 
only on I'(f)] described in Appendix B and the more complicated func- 
tions S, described in Appendix C. Only the first few terms need be 
considered when most of the distortion arises from second and third 
order modulation. 

The definition [equation (167)] of the complex constant S, and its 
series expansion [equation (171)] give 


(K(d))av = So —1 


--if : dp W.(p)S(p, —p) 


+5 [ae [do WAS, 6, ~p, 2) + ae 


Expanding exp [2é¢(u, v, r)] in equation (69) in powers of 2é(u, v, 7), 
replacing each é(u, v, 7) by its defining integral [equation (66)] with p in 
place of f, and integrating with respect to wu and v with the help of 


Silos <-+ 5 00) = favre) exp a@)] TT Ho) 7 
leads to 
(KR = [Kade PE DAL ao [dol TL w.000 | 


-exp [¢2rr(pi: + +++ + pa)JS3(p1 , °° * 5 Pn)Snlfr , °°* Pn)» (72) 
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When this expression for (K*K,),, is put in the integral 
Wel) = [exp (—twr(KAKeee dr, = 2ef (78) 


for the power spectrum, Wx(f), of K(é) and the Fourier transform of 
unity denoted by 6(f), the result is 


Wall) = |KD)ae Pa) + [adr exp (—ter 


f ; du [ dvy*(wr0) exp [a*(u) +a0)] {exp [26Cu, », 2)]—1) 


I 


| So — 1 |? 6f) + Dai fa vee [ «, 


6 — pr — eee od| TI W (0d | | Si(pr 5 *°* y Pn) P 
=|S —1/P 6) + WA | Sif 


+5] do WW. — 0) | Sle,f- 0 E 


+ ; a : dp [ : do W(p)W (o)We(f — p — 0) 


‘| 8:(p,0,f-—p—o)|?+---. (74) 
The leading terms in the series for S,, S,;, S2, S3 in terms of un- 
subscripted S’s are given by equations at the end of Appendix C. The 
inequality for S, given in Appendix C may be used to show that the last 
series in equation (74) converges when W,(f) remains finite for all 
values of f and ([y(é)]’).v is finite. The convergence of similar series which 
will be encountered later will be tacitly assumed. 


vit. “FIRST ORDER” APPROXIMATION FOR POWER SPECTRUM OF 6(t) 


Before taking up the problem of computing W,(f) from @(¢) = Re O(é) 
and 


Ol) = H() — iK() + ZK + Of, 


which is the same as equation (13), we shall go through a similar, but 
simpler, calculation using the “first order” approximation 


Ot) = 6%) —iKO+0¢).- — + - (5) 
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Neglecting O(K’) terms in equation (14) for 6,, , and in the covariance 
expressions given in Section IV, shows that 


O12 = Im (K(t))av + O') (76) 
Wi(f) = 2 Re [P(f) + Qf) + P*(—f) + Q*(—f)] + O@'W,) 
where P(f), Q(f) are the respective Fourier transforms of 
A(r) = (®,(2°*@, — iK,) — 2K, K») ay (77) 


B(r) = (b4(27"@, = 1K.) — 2 KAK.).< 


Expressions for (K*K.).. have been obtained in the preceding sec- 
tion. Repeating the work with K, in place of K* brings in y(u), a(u), 
c(u, v, tT), —H,(—p), and C= Sis Hs 5 —p,) in place of y*(u), 
a*(u), é(u, v, r), and H*(p), S*(p,, --+ , pn). The result is 


(K,Ks)oe = [Kaus]? + [ : Hi [ Wea exp aa ew 
-{exp [2c(u, v, 7)] — 1} 
= ea + EH ae [aol EE wc | 


-exp [72r7(p, + +++ + pr)] 

-Sn(— pi ne) — pn) Sa(P1 a) Pn) (78) 
The remaining portion of A(r) in equation (77) is 
(8,(27'8, i 1K 2))av 


Pe [ ; du | * dy yluyyo)(2* o(t — Welt + 7 —v) — ig(t — uJ, 7))av 
= [aw [ doveone) 


[df WD) exp bile + w)]{2* exp [— ten] + H.() exp (a0)]} 

- (79) 

where w = 2z7f and Table IT has been used in going from the first equa- 
tion to the second. Integration with respect to u brings in I'(—f), and 


integration with respect to v brings in both I'(f) and the function S,(f) 
of Appendix C. 


(BQ, — iK))ow = [a WD ex (io (DITO + S, D1 
80 
Replacing ®, , y(u) by S* , y*(u) causes I'*(f) to appear in place of 
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I'(—f) and shows that the remaining portion of B(r).in equation (77) 
is 
(BHD, — Kw = [af WoC) exp Go TET) + SUD]. 
(81) 
The function A(r) is the sum of —27*(K,K,), obtained from equation 
(78), and (80). Its Fourier transform is 


P(f) = WAT(—Nl27 TP) + Sah] — 27° Ka)aw)” (f) 
- 5 a "dr exp (—#2xf7) / ; ai / : Pee) 


‘exp [a(u) + a(v)]{exp [2cu,v, 7)] — 1} (82) 


where the leading term follows immediately from equation (80) and 
the Fourier integral theorem. 

The function B(r) is the sum of 27'(K#K,),, , obtained from equa- 
tion (69), and (81). Its Fourier transform is 


QA) = WLA)T*AI2 TY) + SiC] + 27” | Kade |? 8(f) 
+ : [- dr exp (—72rfr) i du is dv y*(u)yv) 


-exp [a*(u) + a@)] {exp [2éu,v, 7)] — 1}. — (82) 


A first order approximation for W,(f) may be obtained by combining 
equations (76), (82), and (83). Deleting the terms multiplied by 6(f) 
and W,(f) gives the first order approximation to the power spectrum of 
the nonlinear distortion 6,;(¢). This approximation is stated by equations 
(38), (89), and (40) in the section describing the results. We now proceed 
to express the first order approximation for W,(f) as the series given by 
equation (90). 

When equation (82) for P(f) is added to equation (83) for Q(f) and 
the triple integrals replaced by their series, namely, the Fourier trans- 
forms of the series appearing in equations (78) and (72), the result is 


PA +AAP) = WOE(-) + *Q)] 
“(2°TG) + SiC] + 27°C (Bada |? — ~ [Kao 5(f) 
+5 aa dp, ef Pn af — 6 Rie oS o| TL Wo) | 
S 


: n( Pr Meee Pol) Sal pole sg — pa) S51 »7° "4 pr) |. 
(84) 
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Changing the signs of f and the variables of integration p,, --- , pr; 
and then taking the conjugate complex shows that the series in the 
expression for P*(—f) + Q*(—f) differs from equation (84) only in 
that the S, factors are replaced by 


S*(-p 970% y = p= ( Sas Woe ag Pn) a S,(— pi eS Pg — pn). 
(85) 


Taking (—)""* out of the square brackets in equation (85) and then 
adding the series term in P(f) + Q(f) to the series term in P*(—f) + 
Q*(—f) gives 


aa | de Se [ «, bf — a — a a ol TL wad | 


Sapa tty py) SH) Sil pig ts Se) | 88) 
for the series term in P(f) + Q(f) + P*(—f) + Q*(—f). 


The term for n = 1 in the series of equation (86) is 
2W(f) | Si(f) + S¥(—f/) [’. (87) 


From the first line in equation (84), the sum of the other terms in 


Pf) + QG) + P*(-f) + Q*(-f) containing the factor W,(f) is 
W.OIT(-f) + *Q)] 
[20 2 Tp) ae Si Sta) (88) 
The real part of the sum of equations (87) and (88) may be written as 
aW 60) | TO Ef) PSO Sip (89) 
These results and equation (76) for W,(f) lead to 
Wilf) = 63. 6) +4 °Wo) | TH + Si) + T*(—-f) + SH-A |? 


+- 47? Dafoe. site [ if — pp — ee ~ oT Wo) | 
| S,(p1 ee Pn) ox (=)S3(— Pi 277 4 =p) |? “ Ow'W,). (90) 


The remainder in equation (90) for W(f) is O@*W,) while the one 
in the main result, that is, equation (15), is O@°W,). The result of 
neglecting all O@%*W,) terms in equation (90) agrees with the result 
obtained by neglecting the O(v*W,) terms in the main result. This may 
be verified with the help of 
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Si) = ~5 | de WLe)S(0, —p, ) + OC" 


S2(p, ) = S(p, ?) = Oy’) 
which follow from the equations at the end of Appendix C. 


(91) 


vill. “SECOND ORDER” APPROXIMATION FOR THE POWER SPECTRUM OF 6@(t) 


In Section VII the “first order” approximation to Wo(f) is com- 
puted using the approximation 


Ot) = &%) — 7K + O¢') (92) 


for the complex phase angle O(é). In this section the “‘second order” 
approximation to W,(f) will be computed using the approximation 
given by equation (13), 


O() = Hy — KK) + EK + Of). (93) 


The equations needed are given in Section IV. Portions of the ensemble 
averages A(r), B(r) defined by equation (49) have already been obtained 
in Sections VI and VII. The remaining portions needed are 


(2°, K3)av ; (2K Ka eos (94) 
for A(r) and 
(2BAK aay  (— 2 KAK2) aw (95) 


for B(r). 
From Table IT, 


I 


(276,K2),, = 127? er / ab i reo one 
-(o(t ars ulJv, r)J(u, T) — Jv, T) = J(w, 7) “ Le 


—27 i du [ dv [ dw y(u)y(v)y(w) 


[df WD) exp lias + WIBH.QLVWe — VI, 


w = Qrf. (96) 


In going from the first to the second equation, symmetry in v and w 
has been used to replace H,(f) + H.,(f) by 2H,(f) and we have introduced 
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part of the notation 
U = exp [a(u)], V = exp [a@)], W 


v= 8xp [2c(u, Vv; 7)\, y = exp [2c(u, W, als z 


I 


exp [a(w)] 
exp [2c(w, v, 0)] 


exp [a*(u)]. 
(97) 
As in equation (80), integration with respect to u brings in I'(—f), 


and integration with respect to v and w brings in the functions S,(f;), 
S,(f) of Appendix C: 


f= exp [2é(u, v, tI, gy = exp [2é(u, W, ie U? 


G2*O.K 9 = — [af exp Gon) OT(—-DISwlf) — SN] 08) 


The corresponding portion of B(r), (27'’&*K?),,, is equal to the 
expressions obtained when y(u) and I'(—f) are replaced by y*(u) 
and I*(f) in the right sides of equations (96) and (98). 

The last portion of A(z) is 


OK, K}),, = 2 f ; au [ : ao [ : dia @vta) 


(JG, 0) — I][J@, 7) — Yl, 7) — Dew 
= (C, + D,(7) (99) 
where C, is independent of 7 and represents the value of equation (99) 


at 7 = o., With the help of Table I and the notation defined in equation 
(97), the ensemble average in the integrand may be written as 


UVWayz — UVa —- UWy —- VWz2+U0+V+W-1. (100) 


The only variables in this expression which contain 7 are x and y. When 
T— ©, c(u, v, 7) tends to 0 and x and y tend to 1. Therefore the portion 
of equation (100) which contributes to C; is 


UOVWze — UV —- UW —- VWz+U+V+W-1 
and the portion contributing to D(z) is the remainder 

UVWa(ay ~— 1) — UV(«z@ — 1) — UWly — I). (101) 
The portion contributing to C, will be ignored since the Fourier trans- 
form of C, , namely C, 4(f), is part of 63, 6(f), and 6,, will be treated by 


itself. 
When zy — lis writtenas@ —-)Dy-Dte-VY+y-) 
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equation (101) becomes 
UVWa(e — 1)(y — 1) + UV(Wez — I(@ - 1) + UW (V2 — Dy - 21). 


The symmetry in v and w allows the last summand to be replaced by 
the second and hence 


Di) = [du [ao [aw yadroyrw) 
(27° UVWee — 1)(y — 1) + UV(We — IY — 1)). (102) 


Expanding (x — 1), (y — 1) in powers of c(u, v, 7), c(u, w, 7), respec- 
tively, and integrating termwise in much the same way as in the passage 
from equation (69) to equation (72) leads to 


Di) = PP an [a] Too | 


-exp [72rr(p, + +++ + p,)] 
‘Sn = Pr 9 8 3-— Pe) aa Pept? aha) = alii 5? S* 5 Pa) 


1 Dar ats co 
+5 eer f dp. - [i doy 


fe (ee ce don be wo || Td Wa) | 


“exp [¢2r7(p. Sep, A Oe RSS Gin) 








Cree ee ee ee ee 
SamlOr °° + 5 Pop OL, 077 y Fm) (103) 
When the last portion of B(r) is written as 
(—2°"K*K3),.y = C. + D,(r) (104) 


the work goes through much as for C, + D,(7). The functions y*(u), 
J*(u, 0), U*, €, and @ replace y(u), J(u, 0), U, x, and y, respectively. 
The functions H,,(—p,), and H,(—o,) in c(u, v, 7), c(u, w, 7) are replaced 
by —H*(p,), and —H*(oc,). This carries x, y into 4, 9 and causes S,(p, , 

» —Pn) to be replaced by (—)"S*(p, , -++ , pr). A similar replacement 
holds for Sysm - 

The resulting expression for D,(r) is obtained by changing the sign 
(because —K* replaces K,) of the expression (1038) for D,(r), and then 
replacing S,(—p., -*° —p,) and. Sat Pp je" go Oa) DY (—)"S4(r1 , 

- Pn) and (rr SE a (Pa > *'* ) Om), respectively. 


DISTORTION IN FM WAVES 115 


Now that expressions for the portions (94) and (95) of A(r) and B(r) 
have been obtained (in effect), there remain two problems: 

(t) taking their Fourier transforms to get their contributions to P(f) 
and Q(f), and 

(iz) adding these to the first order approximation for W,(f) given by 
equation (90). 

The Fourier transform of (727'®,K?),, follows from equation (98) 
and the Fourier integral theorem. The Fourier transform of (227*6*K?),, 
may be obtained in much the same way and consequently the contribu- 
tion of these terms to P(f) + Q(f) is 


/ dr exp (—twr)[(i2"* ®, K3).y + (12 BK) av] 


= —W,IT(-—f) + T* AUSF) — SQ). 05) 
Consequently their contribution to the right side of 
Wi(f) = 27 Re [P(f) + QQ) + P*(—-f) + O*(-f] + O/'W,) (106) 
[from equation (50)] is 


2°'W (f) Re (T(—fp + F*QIS.G) — Swolfs) + Si(—) — Si(—f))I. 
(107) 


The Fourier transform of the portion D,(r) of (2-*K,K3),, is obtained 
by replacing exp [22rr(p, + --- + p,)] and exp [22mr(p, +--+ + on)] 
in equation (103) for D,(7) by 6G — pi — +++ — pz) and 6(f — py — 

-+ — op), respectively. The Fourier transform of D,(r), from 
i —27'K*K?),, , can be obtained similarly. The sum of these two Fourier 
transforms gives the contribution of D,(r) + D.(r) to P(f) + Q(f). 
Changing the signs of f and the variables of integration p,, °°: , om; 
and then taking the conjugate complex, gives the contribution of 
D,(r) + D2(r) to P*(—f) + Q*(—f). When the two contributions are 
added, it is found that the contribution of D,(r) + D,.(7r) to Pf) + 
Q(f) + P*(—f) + Q*(—f) is 


Da [. der * [. aod UI W600) | 


“OU Spi ete pe LOR Pilg. 8724) =) Sal = hi <P] 
“USnCor 0+) = (= )"S3(— ar 5 7) 
— Sroler y +++) 7 (—)"Sio(— pry +++] 
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2 : a dp, - ff dom 





c 1 ee 
3 


+ ft win] i wo) | a(f eee 1 ar es Om) 
ade °° ) o- (—)°°" Sn+m(— p1 77° )] 
. [Sam( ps ree ) = (—)"" Sk,.(— a1 as ‘)] (108) 


where the complete arguments of the S functions are shown in equation 
(103). 

The desired expression corresponding to equation (106) for W,(f) 
is obtained by adding the significant terms in the first order approxi- 
mation of equation (90) for W,(f) to the second order terms given by 
equations (107) and (108). The remainder term, O(¢*W,), in equation 
(90) can be ignored because the significant terms are obtained from 
@(¢) — «K() without approximation [compare equations (92) and (93) 
for O(t)]. The result is 


Wf) = 62. Hf) + 4°W of) | TO) + SiG) + T*(—f) + S4H(-D / 
+ 47% > a dp; - ag Op..09.= pesos oo| TL 7 (0) | 


n=2 0 


-| S,(p1 an) Pn) = aC p, ian — Pn) |’ 
-+ expression (107) + 27* Re[expression (108)] + O(@°W,). (109) 


The next section is concerned with the elimination of all O(¢°W,) 
terms from the significant portion of equation (109). When these terms 
are eliminated, the result is the ‘‘main result’ stated in equation (15). 


IX. ELIMINATION OF HIGHER ORDER MODULATION TERMS FROM Wo(f) 


In this section all terms of O&%°W,) in equation (109) for W>(f) 
will be discarded, that is modulation terms of order higher than three 
will be discarded. Since the integral of W,(f) is O(¢’), all terms in 
equation (109) containing the product of four or more W,’s may be 
dropped immediately. 

First consider the terms which explicitly contain the product of three 
Wy’s. This corresponds to n = 3 in the single series in expressions (108) 
and (109), and to the pairs of values n = 1, m = 2;n = 2,n = lin 
the double series. The contribution of the double series can be dis- 
carded because it is O(f [ Wy’) = O@'W,), the functions S,, and 
Si. being O(v’) [from S(f) = 0, S,(f) = O(y’) and Appendix C]. The 
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= 3 term in expression (108) can also be discarded because, from 
Appendix C, 8; — S3. is Of’). Using 8S; — S = O(¢’) in the n = 3 
term in equation (109) shows that the contribution to W,(f) of the 
terms which explicitly contain the product of three W,,’s is 


a) ; dp [ : do W,(p)W (0) W 0) 


-| S(p, oO, v) + S*(—p, Ss, =) |? i O~'W,) (110) 

wherev =f —p-—co. 
Next consider terms which explicitly contain the product of two 
W,’s, namely the terms n = 2 in the single series and n = m = 1 in 
the double series. When we put p,; = p, p.2 = f — p = v in the single 


series terms, and p, = p, o, = f — p = v in the double series term, all 
of the integrands contain the factor 


B* = [S¥(o, ») — S2(—p, —)] 
and the contribution of their sum to W,(f) can be written as 
5 [de WOW.O) Re (88 + 29)] 
where 6 is O(1), and y is not the earlier y(u). Here 
xy = S2(0, ») — Sao(p, 3) — Sir(o; ») — S3(—p, —») 
Sip, 8) re Sale; 2) 
is O(v’) since both S, — Soo and S;, are O(y’). Furthermore, 
Re [6*(6 + 2y)] = |B +7) —yw* = |B +7) + Of”), 
B++7 = T(o,») — T*(—p, —»), (111) 
T(p, v) = 282(p, v) — Seo(p, ¥;) — Silo, »). 


The equations at the end of Appendix C may be used to show that 
T(p, v) is equal to T(p, v) + O(¢*) where 


l 


Tp, ») = S(o, ») + zl “do We()L-38(0, —0, ps2) 


+ 2S(c, —o)S(p, ») + SG, p)S(—a;»)] (112) 


and consequently the contribution to W,(f) of the terms which explicitly 
contain the product of two W,’s.is |) - 
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Lo 2 6 ‘ 
5 | do WW.) | To, ») — TH—p, —») + OW.) (113) 
where v = f — p. 


Now consider the terms in W,(f) which are multiplied by W,(f). From 
equations (109), (107), and the term for x = 1 in the single series in 
equation (108), the sum of these terms is W,(f) multiplied by 


4 SG) Pa: ep) Sep) fF 
2" Re [(—f) + FAQS) — Su) + Sip — Sif) 
+ 27 Re [S4Q) + Si(-PILS:Q) — Silf;) + Si(—-f) — Si(—-f2)] 
= 47 |a +6 |? + 27 Re (a* + B*)(y) 
4"[(a + B)(a* + B*) + @* + B*)y + @ + By* + 17* — 1] 
47% |a+6+y7)? — 4 y7* 
= 47 | Of) + U*(-f P -— 4 y7*. (114) 
Here, with 6 and y different from those in equation (111), 


a= PQ) T=) = 00) 
B= SQ) + Si(—-f) = 0%) (115) 
y= Sif) — Silf;) + SU—-/ — Si(—-f) = O¢') 

O(f) = Tf) + 28,(f) — Sro(f3)- 


The equations at the end of Appendix C may be used to show that 
U(f) is equal to U(f) + O(v*) where 


uN=T0-5] doW.OSl, 0.) + [de | do WW.) 


‘[sS(p, o, ~Pp;, ~G, f) =4 4S(p, — Py) )S(o, —o) 
— 2S(o, NS, —2, —p) — 38(p, 0, f)S(—p, —o)]. — (116) 


Since yy* is O(*), the terms in W,(f) which are multiplied by W,(f) 
can be written as 


WO) |UG) + U*(—-f/ + O@'W,). (117) 


Finally consider the de spike, 63, 6(f), in W,(f). From equation (14), 
the de component of 4@(£) is 


9a¢ = Im (K(t) ian 27° K* ())av + O¢"*). (118) 
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The value of (K(E))., is given by equation (70) and from equation (78) 
with r = 0, 


(K*(D)ue = (K(ONe — [dp W)S(— S00) 


+ ; [- dp iz doa W .(p) W .(c) S2(— p, —a)S2(p, c) ae Oy"). 


Since S,(p) is O(’), the single integral is O(v°) and may be included in 
the remainder term. Squaring the leading term in equation (70) to get 
(K())?, combining terms, and using S2(p, ¢) = S(p, ¢) + O(’) 
leads to 


(K(t) — 2°) = D, 
p= -3/ ; dp Wle)S(, —0) + f : de | ; da W 4(p)IV g(a) 


‘[gS(o, ¢, —p, —2) — 3S(p, —p)S(o, —2) — 28(p, «)S(—p, —9)] 
+ O(¢’). (119) 


The imaginary part of D, gives 62. . 
Addition of equations (110), (113), and (117) shows that 


Wolf) = G2. Of) + Wo(f) | UA) + U*(—f) |’ 


- : ‘es dp W.(o)W.(f — p) | Tp, f — p) — T*(—p, —f + p)|’ 


a 34 it : dp : do W,(p)W (0) W 0) 


=| SQ. e52) FS =p, =e; =p) | OG W,) (120) 
where v = f — p — a. This is the same as equation (15) in the state- 
ment of results. However, the expressions for D,, T(p, f — p), and U(f) 
given in Section III are simpler than the ones given in this section. The 


method of obtaining the simpler expressions will be outlined in Sec- 
tion X. 


X. SIMPLIFIED EXPRESSIONS FOR 6g,, U(f), AND T'(p, v) 


The expressions obtained for 64,, U(f), and T'(p, v) in Section TX 
may be put in forms better suited to calculation by writing the higher 
order S functions in terms of S functions of two arguments, 


S(p,) = T(p +4) — T(o)T(). (121) 
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These simplified forms are the ones stated in equations (16), (17), and 
(18). 

Since no really satisfactory procedure of reduction was found, the 
expressions given here may not be the simplest. The procedure is il- 
lustrated for the double integral 


r= [dof do WW.) 


-[S(p, —~P, 9; —o) = S(p, —p)S(o, —o) = 2S(p, o)S(—p, —o)] 


which appears in equation (119) for 6,, = Im D,. 
After some cancellation, the general equation (159) for S(p, o, v, p) 
shown in Appendix B gives 


S(p, —p, ¢, —7) = 1 — ()(—p) — (¢)(—2) + (6 + 2)(—e)(-2) 
++ (o — o)(—p)(c) + (—p + 2)(e)(—o) 
+ (—p — @)(p)(c) — 3(6)(—p) (2) (—2). 


Here I'(x) has been written as (x) and (0) = I'(0) = 1 has been used. 
When this expression is multiplied by W,(p)W,(c) and integrated 
with respect to p and oc, changes in the variables of integration show that 
the value of J is unchanged by the substitution 


S(p, —P; 9; —c) l= 2(p)(—p) = A(p “F a)(—p)(—a) 
— 3(p)(—p)(o)(—o). 


Here the arrow means ‘‘may be replaced in the double integral by’’. 
Similarly, 


—S(p, —p)S(e, —2) = —[1 — ()(—p)][1 — (0)(-0)] 
—> —1 + 2(p)(—p) — ()(—p)(o)(—o) 
—28(p, 0)S(—p, —c) = —2[(p + 2) — (o)()][(—p — 2) — (—p)(—0)] 
> —2(p + o)(—p — 0) + A(p + 0)(—p)(—0) — 2(6)(—p)()(—0). 


Addition shows that the quantity within the square brackets in the 
integrand of IJ may be replaced by 


8(p + o)(—p)(—a) — 2p + o)(—p — 0) — 6(p)(—p)(o)(—o) 
6(—p)(—o)[(p + 2) — (o)(0)] — 2( + o){(—p — «) — (—p)(—9)] 
6(—p)(—2) S(p, «) — 2(p + «)S(—p, —0) 
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il 


6[(—p — 2) — (—p— 2) + (—p)(—2)]S(p, 2) — 2(p + 2) S(—p, —2) 
6(—p — «)S(p, c) — 6S(—p, —2)S(p, 2) — 2(p + «)S(—p, —2) 
ar A(p “ o)S(— p, —a) = 6S(—p, —«a)S(p, 0). 


Hence 


T= [dp [de WNW Lo) S(—p, —W)I4T(6 + 0) — 680, 2] 


which is the form of I used in equation (16) for D (except that p and 
o are interchanged). 

The simplification of equations (112) and (116) for T(p, v) and 
U(f), respectively, proceeds along the same lines. In dealing with 
U(f), the symbolic substitution 


S(p, o, —p, —o, f) > [1 + (p)(—p) — 2y’(—p)] 
[1 + (¢)(—o) — 2y7(—o) ly’ — (I 


was found helpful. 
In addition to the simplified forms for T(p, v) and U(f) given in 
equations (17) and (18), we also have 


S(p, Oo, v) = S(p + 0, v) == T'(p) So, v) = T'(c)S(p, v). (122) 
XI. THE “SMALL AND SLOW” DEVIATION APPROXIMATION TO Wo(f) 


This section and the following one are concerned with approxima- 
tions to Ws(f) which are obtained by replacing the I’s used in equa- 
tions (15) to (20) by the first few terms in their power series expan- 
sions. These expansions are assumed to exist and to converge rapidly 
over the range of frequencies for which W,(f) is effectively different 
from zero. Roughly speaking, the top baseband frequency is assumed 
to be small compared with the filter bandwidth. 

When the top baseband frequency is small, the modulating fre- 
quency, ¢’(t), changes slowly and we have the quasistatic case. The 
name “small and slow deviation approximation” is used because (15) 
holds only for “small” rms frequency deviations (D small), and here 
the requirement of “slowness” is added. 

Two series which play important roles are 


I 
_ 


Tf) = YLaf/n!) — a (123) 


s iMs 


l 
oO 


InT() = DE Mf*/n!, No (124) 
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The first one is the series assumed for I'(f). Substituting (123) in 
(124), expanding the logarithm, and equating coefficients of powers 
of f leads to expressions for the \’s in terms of the a’s: 


N=, 

Ae = A — ai, 

3 = @3 — Bara, + 203 , (125) 
he = as — 4030, — Baz + 120,07 — bai ; 


As = as — Saya, — lOasao + 20az0;, + 30a2a, — 60a.0% + 240% . 


and so on. When the a,’s are the moments of a probability distribu- 
tion, the X,’s are the associated “cumulants” or semi-invariants. In 
our problem the a,’s are proportional to the moments of the normal- 
ized response y(t), a relation which follows when the series (123) for 
T(f) is compared with the one obtained by expanding exp (—72xft) in 
the Fourier integral (4) for T(f). 

The small and slow deviation approximation obtained from (15) 
and the first few terms of (123) is 


Wolf) > Ge 5(f) 
+ WM +P {Oke + 27D 3; + 87 Ds,)? + (G2, + 27D? A,) }) 


+27 + 27D)? [de WOW — oS — 0 


+ 600° [dp [de Wy(o)W (Wolf — 0 — a)p'o%(f — p : a)? 
an “re 126) 


Here the a,,’s, An’s are the imaginary parts of the coefficients in the 
series (123), (124), D? = ([¢’(t)/(2r)]*), D is the rms frequency 
deviation in Hz, and a2,, A, denote the real parts of a2, A where 


A = a, — 2a;0, — a; + 2a ai. 


The detailed derivation of equation (126) from equation (15) for 
W.(f) makes use of equation (162) which gives S(a,, x2, --- 2,) for 
small values of the x’s. The leading term in equation (162) gives 


S(p, 7) — pors 
S(p, 0, v) > povrs 
—S(o, —2, p, rv) + So, —o)S(p, v) + 28(c, p)S(—a, ») > o’ pvr. 
(127) 
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where the left side of the last equation is proportional to the integrand 
in equation (112) for T'(p, v). There is a similar equation which shows 
that the integrand in the double integral in equation (116) for U(f) 
tends to a quantity proportional to p’o’f\;. To deal with the single 
integral in U(f) we use both terms in equation (162) to obtain 


S(p, —p, f) > —p'fry — 2°'p"fPA. (128) 


Combining equations (127), (128), and the first three terms in the 
series for I'(f) gives 


S(p, O,; v) — pavrs 
T(p, ) > pr(de + 27 D?d,) 
U(f) 31+ (ey + 27!D*dy + 8D) f + (Q7ay + 47D? Af’. 


Substitution in the small deviation approximation (15) for W,(f) then 
gives the small and slow deviation approximation shown in equation 
(126). 

A form of the small and slow approximation which is more complete 
than (126) may be obtained by starting with the quasistatic form of 
equation (7) for O(¢) instead of from the small deviation approximation 
(15) for W,(t). In the quasistatic case the instantaneous frequency 
Q = wo + ¢’() changes slowly and hence rms ¢’’(é) is small. This leads 
us to replace ¢(é — u) in (7) by the equivalent expression ¢(t) — uy’ (t) + 
27*u?o'"(~) where & lies between t — wu and ¢. Let F denote the filter 
bandwidth and suppose that the impulse response y(u) is effectively 0 
outside an interval of length 1/F. Then, heuristically, the integral in (7) 
is given by 


i y(u) exp [¢e(¢ — u)] du = (1 + O(2-'F-? rms ¢'’)] 


[veo exp lie(®) — éue!(0) du. 


The integral on the right is the desired quasistatic approximation. It is 
almost equal to the integral on the left when 27-'F~* rms ¢”’ « 1. How- 
ever, for small rms frequency deviations, the contribution of ug’ (t) may 
be less than the term 2-’F~? rms ¢” even though 


(t) the latter may be < 1, and 

(iz) despite the fact that when ¢(¢) is band-limited with top frequency 
B we always have rms g”’ S (278) rms g’. Therefore, in order to make 
the quasistatic approximation meaningful for small (as well as large) 
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deviations, we impose the additional restriction 2° F~? rms ¢”/(F* rms 
g’) < 1. Then, if 

rms 9 /(2F’) <1, rms ¢”"/(2F rms ¢’) <1 


the Fourier transform (4) gives the quasistatic approximations 
[100 exp Gio(t — w)] du a Pie’()/@n)) exp [iol] 


O(t) = g(t) — 7 In Tye’()/2z)] 
which are equivalent to the usual quasistatic approximation for the 
filter output, namely 
So(t) & GQ) exp [twot + zv(F)]. (130) 


D. T. Hess’* has given a rigorous bound, roughly equivalent to rms 
g'/(2F") <1, for the error in (130). 

For the flat FM baseband case discussed in Section 3.3 the above 
restrictions go into 


(129) 


10 DB/F? «1, 2B/F<«1 


where D is the rms frequency deviation in Hz and B is the top baseband 
frequency in Hz. Notice that although the term “quasistatic” implies 
that rms ¢”’ tends to 0 in some sense or other, the requirements that the 
quasistatic approximations (129) and (130) hold differ from the require- 
ment that the deviation ratio be large, a condition used in calculating 
the quasistatic approximation to the power spectrum of cos [wot + ¢(t)]"* 
Thus, for the flat baseband case, the deviation ratio can be taken to be 
D/B, and this does not have to be large for (129) and (1380) to hold. 

To continue with the derivation of the more complete form of (126), 
we substitute the series (124) for In I'(f) in (129) and take the real part. 
This gives 


a(t) = elt) + BY) 
B(t) = yD deale'(t)/(2n)]"/n. 


Since B(t) depends only on ¢’(é), the power spectrum of y(t) + B(#) is 
W.(f) + Wa(f) (Ref. 15). The covariance of B(é) is 


(BB) = 32 dwaduileltes" 2m)" (a! ml) (131) 


n=1 m=1 


where subscripts 1 and 2 denote arguments ¢ and ¢ + 7, respectively. 
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From the characteristic function of the joint gaussian distribution 
function of ¢/ , of we have 


(te) aoe 2 [~ du jin dv exp [— y(ur ae oe feo yur] 


n! m! U v ay 





- aet= ama + (-)*"4J 2 Nene 
= Dag 40 [(n — 2 + 1T[(m —H27 py aye) (¥) 


(132) 
where ¥, = (v/¢5), Wo = (2rD)’, and we have used, for integer 1, 


(0+) om £ ¢/2 
(2ri)”" / wu *" exp [—y,u’/2] du = : she pie a 





Actually, instead of © the upper limit of summation for k in (132) is the 
smaller of n, m. Also the sum is 0 unless n, m are both even or both odd. 
When 7 is even, & runs over even integers; and when n is odd, k runs over 
odd integers. When (132) is substituted in (131), the Fourier transform 
of the resulting series for (B,B,) gives a series for W,(f) which leads to 
the more complete form of (126) we have been seeking, namely 


co) 1 0 ) eae iD 2 
Wif) >We) + DX El b Danes cD 


-(Q0)~** ie W* exp (—72rfr) dr (138) 


The integral in (1383) can be expressed as a (k — 1)-fold convolution 
of the power spectrum W,.(f) = (2rf)’W,(f) of ¢’(é). This gives the 
first few terms of (126), except for the term (a2, + 2°'D°A,) W,(f) 
which arises from terms neglected by (133). 

Equation (133) is useful only when D is small because the summation 
with respect to n usually diverges. To illustrate this, consider the single 
pole filter for which I'(f) is (1 + 7ffz’)~* and Xd, is (n — 1) '(—7f7)*. 
Equation (1388), with k ee by 2k + 1, gives 


Wl EW + Le oT = Di pb aan o* 





(24)? [- v2"? exp (—712rfr) dr. (134) 


The quasistatic approximation for Wolf) obtained by se: with 
(130) is (seé Ref. 13) 
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Wolf) Wa) + PED [yi exp (—i2efe) ar, 


va) 2 
eke ee 
Ee - | : exp | ; =| ae 


This series, which converges for all D, is of the same form as (134) in 
that it has I,,/D”** in place of the divergent sum with respect to 7. 
When D becomes small, the two expressions for W,(f) approach equality 


in the sense that the sum with respect to 7 is the asymptotic expansion 
of igo 


XII. LIOU’S APPROXIMATION FOR SECOND AND THIRD 
ORDER INTERCHANNEL MODULATION 


It is instructive to relate our main result, equation (15) for Wo(f), 
to an approximation for the interchannel modulation given by Liou.® 
Liou’s approximation is equivalent to taking additional terms in the 
small frequency deviation approximation given in Section XI. 

The interchannel modulation is represented by the portions of 
Wo(f) in equation (15) which contains T(p, f — p) and S(p, o, f — p 
—. oc). Liou’s approximation may be obtained by (2) approximating 
T (p, f — p) by the leading term, namely S(p, f — p), in equation (17) 
and (i) expanding S(p, f — p), S(p, o, f — p — a) in powers of 
p, o, and f out to and including degree 4. This leads to 


T(o, f — p) © Sle, f — op) = TY) — Toe)TE — p) 
_ e(f ae p) [Az = fle i fl, i o(f ™~ p)ts] a Of’) 
S(p,o,f — p— 0) = po(f — p — ao), + fh] + Of) (135) 


where 


lI 
Ms 


Mf) 


anf’ /n!, a = 1 


tt 
° 


he = — ai, dz = a3 — Benq, + 2aj 
f, = (a, — 2asay — a3 + 2a,a0%)/2 

f, = (a3 — a0)/2 

£3 = (a4 — a30)/6 

le = 4°"(ag -- 0) — 3°*(a4 — aga). 


Equation (162) of Appendix B gives the approximation for S(p, ¢, 


(136) 
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f — p — o) shown in equation (135). It does not give the higher order 
terms in S(p, f — p) shown in equation (135). These must be cal- 
culated from the series for T(f). Although the £’s and X’s used here 
are not precisely the same as those used by Liou, they are of the 
same general character. 

When the expressions [equation (135)] for T'(p, f — p) and S(p, o, 
f — p — a) are used in equation (15) for Wo(f), the second and third 
order interchannel modulation terms are found to be 


5 [do WWE = Aer — 0)" 
. { [re + Fla ++ p(f = peas} “+ TG. 
1 2 2 p2 “ ° 
+o + FG [aw] a 


W(oW(oOW.G — p— op o(f —p—a) (187) 


where the second subscripts r and 7 denote “real part” and “imaginary 
part.” The basic approximation used by Liou [his Eqs. (29) and (30) ] 
may be put in this form by expressing his Fourier transforms as 
convolution integrals and combining terms. 


APPENDIX A 


Power Spectra of Real and Imaginary Parts 
of a Complex Random Process 


Let z (t) be a complex, stationary, ergodic, random process [for ex- 
ample the complex phase angle O(t)] and let x(t), y(t) be its real and 
imaginary parts. We seek convenient expressions for the power spectra 
W.(t), W,@ of x(t) and y(é) when 2(é) is the sum of several correlated 
complex random processes, say a(t), b(#), c(t), --- . For illustration we 
take 


a(t) = a(t) + b@) + ce) (138) 


which corresponds to equation (18) for O(é) with a(é), b(é), and c(é) 
in place of &(é), —7K(t), and 727'K°(t), respectively. 

Denoting functions with arguments ¢, ¢ + 7 by subscripts 1, 2 and 
using relations of the type 


leads to the following expression for the ensemble average (21%2)ay 
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(ty02)av = ((2, + 24) (o + 2%))av/4 
= (212. + 272%) + (e422 + 212%) )av/4 (140) 


= 27" Re ((1z2)av + (2'e2)aw)- 
It is convenient to write (2,22)av aS 
(2122)ev = A(r) + A(—7) (141) 
where 
A(t) = ($(aid2 + Bibs + C12) + aide + Gics + Dico)ev- (142) 


This is suggested when the product 2,2, is multiplied out and terms of 
the type a,b. + b,a, are considered. Thus, if 


(ayb2)av = (a(t)b¢t Si 7) dew = f(r), (143) 
setting ¢ = ¢’ — 7 and making use of stationarity leads to 
(bid2)av = (a(t + 7)b)) w= (a0)OU — 7))av = f(—7) (144) 


and hence to equation (141). | 
Similarly, replacing a,, 6,, c, by a*, b* , c® leads to writing the 
second ensemble average in equation (140) for (x22)ay as 


(2iZ2)ev = Br) + B*(—7) (145) 
where 
B(r) = (3(a%az + bYb, + c¥e2) + a%b, + acs + dics). (146) 
For terms of the type a¥b, + b*a, , the analogues of equation (143) and 
(144) are 
(aibe)av = (a*()O(t + 7))av = f(z), 
(Dids)av = (a(t + 7)b*())aw = (a(t) D*(U’ — 7))aw 
= (a*(t)b(t? — 7))% = f*(—7). 


Comparing equation (13) for O(@) with equation (138) for z(é) sug- 
gests setting a(t) = S(t), b(t) = —7iK(é), and c(t) = 7127°K°(t); this 
leads to equation (49) for A(r), B(r) given in Section IV. 

Equation (140) for the autocovariance (7,%2),, of x(£) now takes the 
form 


(147) 


(t1%2)av = 27" Re [A(r) + A(—1r) + B(r) + B*(—7)], (148) 
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and the power spectrum of 2(é) is 


W.(f) = ie exp (—1wr){1122)av AT, w = 2Qrf. 


This may be written as 


W.(f) = 2" Re [P() + Q(f) + P*(—f) + Q*(—-P)] (149) 


where 


P(f) = p exp (—twr)A(r7) dr, oe 
a 150 


Q() = [exp (ion) BG) dr. 
Equation (149) for W,(f) may be derived from 
2°* Re [A(r) + A(—7)] = 4°"[A(r) + A*(r) + A(—7) + A*(—7)] 
27" Re [B(r) + B*(—7)] = 4"[B(r) + B*(r) + B(—7) + B*(—7)] 


(151) 
and relations of the type 
pap =f : exp (iz) A*(1) dr = f : exp (—tws)A*(—1) dr 
P(-f) = i’ exp (—tw7)A(—17) dr, (152) 


co} 


Pipi i atti aHGiae 


The power spectrum W,(f) of the imaginary part y(t) of z(é) may be 
computed in much the same way, starting with 


(YiYo)av = ((2, — 24)@2 — 2))av/ (27)? 
= 27’ Re Lica ie +- (2520) ae |< 


This differs from equation (140) for (7,72),, only in the sign of (2:22)av - 
Therefore only the signs of A(r) and P(f) need be changed in the earlier 
work, and we get 


Wf) = 2° Re[—P(Q) + Q() — P*(—-f) + O(-/)] (153) 
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APPENDIX B 


The Functions S8(p, 0), S(p, 0, v), °° 
The function 


Se ym, +++ 2) = [ duv(wHaCe)Hlas) «+ Hite.) — (154) 
where 


Hx) = exp (—72rru) — I(x) | (155) 


is a symmetrical function of the x’s. It may be expressed as the sum 
of products of I’s by replacing the H,’s by their definitions, multiply- 
ing out, and using the fact that T'(f) is the Fourier transform of y(t). 
This evaluation of the integral shows that S(x1, v2, ‘+: , tm) iS given 
symbolically by 


S(t, 2, +** )%) = II yt — Te) (156) 


where, after multiplying out, the various powers of y are replaced 
by I’s according to the rule y* > T(z). 
For example, 


S(p) = 0, 
S(p, 0) = [y’ — Toe)]ly’ — T)] 
y’*" — yo) — M(p)y” + P(e) P(e) 
T(p + o) — T()T) 


(157) 


ll 


S(p,o,v) = Mp to+) — Mot ol) — To +») 
— To + »)P(p) + 20(o)T(o)T@). (158) 
For four variables, writing (x) for T(z), 
S(p, 0,7, uw) = (op to tv+u) — (pt+ot+r)(p) — (o+o4+p)@) 
—(p tv + we) — (© tu tro) + (0+ JOH 
+ (p + v)(o)(u) + (op + u)(o)@) + (© + »)(0)(u) 
+ (o + w)(e)) + & + u)(p)(o) — 3(p)(o)(H)%). (159) 


S(x1, 2, *** , Z,) vanishes when one or more of its arguments are 
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zero because H,,(0) is zero. Of interest is the form taken by S(z,, 
* ,%n) when the x’s are small and r'(f) may be expanded as a power 
series in f. Let the power series be 


oe 7 


rip) = yee ay = 1. (160) 


n= 


Since 


ri) = [ya exp Gf) du, & = ie 


= Lf wor du 


n=0 n! 
it follows that 
poe i ase: (161) 
When 2 is small, equation (155) for H,,(x) gives 
Hx) = exp (é&) — T'@) 
xl — a) + 2 *2( — o)] + O@"). 


Il 


Then 


II F(x) = (24% Sanus nd] a a)" a aE i ay) = arn) > n| 


+ O(x"*”) 
and substitution in the integral [equation (154)] defining S(a,, --- , z,) 
leads to 
S(x, qa" j Xe) 
= (ote + am) [ auvtan] © (Mean! 
-3< t=0 
n n-1 — 
a a Ss Lp » ‘ ; es = fay(—ay | te O(x"*?) 
1 =0 


= (140 °° | ("eu 


+27 > Ly 3 % 7 "(au _ way(—a,)* | ES Ow). (162) 


This is the approximation. needed to examine the form taken. by- W,(f) 
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when the bandwidth of y(t) becomes small, as it does in Section XI. 
When I(f) is such that | r(f) — 1| < e « 1 for all values of f, it 


may be shown that the symbolic form of equation (156) for S(a,,°-> , Zn) 
becomes 
S@,---,a) = [Jy - 1) + 0€@) (163) 
k=1 
and the I’s appear only linearly. Furthermore, S(a, , +--+ , %,) is O(e). 
For example 
Sosy =y oy i+ Oe) 


I 


Tip + oc) — Mp) —To) +14+ O(e). 


This result is of interest in connection with the first order approximation 
discussed in Appendix D. 
To establish equation (163) let 


a =y"* —1, e«, = 1 — T(,) 


so that equation (156) for S becomes 


S(hi5 #4 Ba) 


I (Z, + €) 


Ilz + I] Ex IT’ Ze + O(e). 


k=1 


(164) 


ll 


Here the factor z, is omitted from [[’. When the product 


m 


aee ++ 2m = [Ty — 1) 
k=] 
is multiplied out and the y”’s are replaced by I'(u)’s, the result is the 
sum of 2”Q’s [(1 = I°(0)]. Half of the I’s will have plus signs and the 
other half will have minus signs. Adding +1 for each —T and —1 for 
each +I shows that the entire sum is O(e). Hence 2:22 +++ Zm is O(e), 
and when this is used to show that the []’ in equation (164) is O(6), 
the result stated in equation (163) follows. 


APPENDIX C 


The Functions S, and Sym 


The functions S,(4 , 22, -°° , Za) ANG Syn(2y °° y Wn Yr °° y Ym) 
are defined by the integrals, for n 2 land m 2 1, 
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Sily p++ ya) = [du tw) exp fated] T] Hule), 


(165) 
San®s pe a Re Ug 5 Ur) 
= [ad] dw yo)r(w) exp (abo) + alw) + 2e(w, », 0] 
where H,(x), a(u), c(w, v, 0) are given by 
Hx) = exp (—72rau) — T(z) 
aw) = -5 [af WH-DH) (166) 
1 foe} 
ow,» 0) = 5 | df WDHA-NHD) = cb, w, 0) 
[see equation (66)]. For n = 0, Sp is defined as 
8, = | duu) exp fa} (167) 


and for the double subscripts, 


Sno(@1 ptt 9 Xn ;) 
= [a [dw ro)yw) exp fae) + a(w) + 2e(w, v, of] IT Hae) 
Sro(X1 pitty Un 7) = Sonia fen pare) 


Soo(:) = fav [dw yW)r(00) exp (ala) + ate) + 2e(w, , 0]. 
on ei (168) 


The functions S, and S,,, depend upon both W,(f) and ['(f) [through 
H_,,(f)]. This is in contrast with the function S(a,, -:- , %,), defined in 
Appendix B, which depends only on I'(f) and is independent of W,(f). 

The function S, may be expressed as the sum of multiple integrals 
involving the functions S. Expanding exp [a(w)] in powers of a(u) and 
replacing each a(u) by its integral [equation (166)] with p in place of 
the variable of integration f leads to 


[-) _1y\i ie) co 
s -1+ De | do | dp; 
7=1 : —3o —3 
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bie 
1H W Cpe) |S(o1 9° °° 9 Poy “Pry **? 5 —p;) 
k=1 
DaCir Wy ime in) 


= Sle yt, s+ 2) + PE = dp | do, 


{0 W(0) [Slo » + » Pi y Pry °** y TP7 7%, 7" -o,) 
Ra (169) 
where n = 1. 


Similarly, expanding exp [2c(w, v, 0)] in the integrand of the integrals 
defining the S,,, functions leads to 


Soo(;) = 88+ yf a ef ae, 





{U Wo) |S.o p PE? 5 Dy SK = p52? sr py) 
Srol@ °° » Xn ;) 
= SoSn(a1 ott ta) Bs > a dp, ° fe dp; | Wo) | 
S54n(01 > °° y Pi > UM, 0" » En) Si(— pr as — p;) 
Sam(X1 972% 9 ha Yr, °° bl) = Sr(x1 gens ti) Sai aan Um) 
LE aos +f don | TT (00 | 
ae > q! ie dp, in dp; I W (px) 
*Sjan(o1 »°°% » Pp 7%, 7°? {BS seal— Pi 27°59 P77 Yry**? 4 Ym) 
(170) 


when n 2 landm = 1. 

In order to obtain an inequality for S,(%,, --+ , %) assume that (2) 
the termwise integration in the derivation of the series in equation 
(169) is legitimate, and (77) an M > 1 exists such that for all real values 
of f 


M > |TQ)| = 1G + fo)/G@(fo) |. 


Since S(x,, --+ , %,) may be expressed as the sum of 2” terms, each of 
which is a product of not more than 7 of the I’s, 


[Sig +** 42.) | <= 2F, 
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Furthermore, since the integral of W,(f) from — © to + is equal to 
(v"(é))av » the terms of the series in equation (169) for S,(a,, --* , 2n) 
are dominated by the terms in 


ur (yi (2M jit = (2M)” exp [2M oe Vav]- 


Therefore the series in equation (169) converges and 
| SQ, +++ 5 2n) | < (2M)” exp [2M*(y’)). 


This inequality may be used to show that the series in equation (170) 
for S,m converges and that 


| Sam(%1 root yy Uy Yr, tr, Ym) | << (2M)"*" exp [(SI1*<y") wv]. 


The leading terms in the series required to handle the second and third 
order modulation are 


$= 1-5] dp W.0)Sle, -0) 
as ; iz dp is do W (p)W (a) S(p, ¢, —p, —¢) + OY’), (171) 
Si) = 0-5 | de W080, - o,f 
af ; ie dp ie do W.(p)W ,(c)S(p, ¢, —p, —2, f) + O'), (172) 


S:(0,») = S(o,») — 5 | do W.(0) Sle, —2, p,») + Ol"), (173) 
S3(p, Oo, v) a S(p, GO; v) ot Oy’), (174) 
; 1 eo ioe) 
Su) = +5] dof do WLe)W(@)S(0, 0) S(—p, —2) 
+O"), (178) 
Soff) = 1) + [dp [do WW OE" Sle, —0, NS, -0) 


- 2°" S(p, f)S(o, So; —p) ere 27" S(p, 0; f)S(—p, —¢)| a3 Oy’), 
(176) 


Ox ee Re 5 a "fe WSC: =08G) AO. 
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S3o(p, Go, v5) = S3(p, 0, ) - Og’), 
Sule) = — | de W.(e)S(o, 0) S(—2,») + Ol), 


Sei(p, 737) = OY’). 
In obtaining the leading terms in So9(;) and S,o(f;) the leading terms in 


So, S.(f) and S2(p, f) were used. S,(f) is O(~’) in contrast with S, , 
S., S;, °-+ which are O(1). 


APPENDIX D 


Derwation of Earlier First Order Approximation 
by Present Procedure 


The first order approximations which are given in Section VII 
are somewhat more complicated, as well as more accurate, than the 
ones which have appeared in the literature.® * °? Here the relation 
between the earlier and present work will be brought out by applying 
the procedure of Section VII to obtain a first order approximation, 
which is the same as the given in Ref. 10 [the @(t) of Ref. 10 is A(t) 
— y(t) in the present notation]. 

The derivation starts from the initial equations (5) and (7) for 
the filter output so(t), 


So(t) = {exp [—a — 78) + 1O(t)]} exp (wot) (177) 
@O(t) = —7 In {f" du y(u) exp [ie(t — u)]?- 


The difference between the output phase angle 6(t) = Re O() and the 
input phase angle g(t) is assumed to be small, and the filter delay is 
usually taken to be zero at the carrier frequency, that is, Im [dT (f) /df] 
is zero at f = 0. 

Adding and subtracting zy(t) [instead of the linear portion ®(¢) of 
the output] in the exponent appearing in equation (177) for O(@) gives 


Od) = g(t) —iln[1 + kM), (178) 
Ki) = [du ye) {exp belt — w) ~ iol] - 1). 


The first order approximations to the complex phase angle @(¢) and 
the output phase angle @(¢) are now 


Of) Y v(t) — tk) 
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and 
A(t) © elt) + Re [—zk()], (179) 


respectively. 
The analysis of the earlier sections goes through much as before with 
k(¢) in place of K(é), and 


h,(f) = exp (—72rfu) — 1 (180) 
in place of 
H,(f) = exp (—72afu) — T(/). 


An illustration of how h,(f) enters the analysis is furnished by the 
computation of (exp [7e(¢ — u) — ie(t)])... AS suggested by equation 
(56) for g(é — u) — &(t), let 


Ig(t) — g(t — u) — oft) 
e(f) = exp (—72rfu) — 1 = hf) 


5 / ; df W(Nepe— 


(181) 
-5 i oa W PhuNh(—f) = a’(u) 


exp [a’(u)]. 


(exp [tp(¢ — u) — t¢(t) av 

Of most interest in practice is the power spectrum W;(f) of &(d), 
E(t) = Re [—2k(d)] 
A(t) & od) + Ed) 


where £(t) is an approximation to the distortion. The power spectrum 
W,(f) is the Fourier transform of the covariance (££) where, as before, 
subscripts 1, 2 refer to times t, t + 7, respectively. By putting &(8), 
—ik(t) for 6(¢), O(¢) in equation (47), or directly, 


(Eiko)av = 2° Re [—(kyka)av + (hike) avd - (183) 
It may be shown that 


(182) 


(bi)oe = [du rw) exp [a"Ca)] 


(kahs)ou = Chale + [du [ dv-veaye) exp [a'w) + a’Q) 


-{exp [2c’(u, v, 7)] — 1} 
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(ethader = | (der P+ [du [ do-a*y0) exp [a + a’) 


-{exp [—2c’(u, v, 7)] — 1} 
(184) 


where 


ou, 0, 2) = 5 | df W.Dh(—DhW exp (af) 


a’(u) = c’(u, u, 0) (185) 

c'(u,v, —7) = c’(v, u, 7). 
Since h(—f) is equal to h*(f), a’(u) and c’(u, v, 7) are real. Furthermore, 
c’(u,v, r) = —3IR,(r tu —v) —R(r — 0) — Ror + uv) + B(7)] 
a’(u) = —R,(0) + Ryu) (186) 


where f(r) is the covariance (y(¢(é + 7))av of of). 

The expression for (££). obtained by combining equations (183) 
and (184) is similar to equation (8) of Ref. 10. 

The power spectrum of £(t) may be written as 


Wef) = &. a) + 27 Re (PS) + QP) + PX) + OX—P] C187) 
in which £,, is equal to Im (k,),,. and 
PU) = | dr exp [—a2nfr]l—A(kaks aw — a)E], 
a (188) 
Q() = J dr exp (—i2af)B(kiRa)aw — | (h)er [- 


Addition gives 
PO) +.) = | dr exp (—i2efr) [du [av exp [ar + o') 
. — $(u)y@) {exp [2c’(u, v; 7)] 7 1} 


+ 37*@)r0) {exp [—2e'u, », 7] — 1}) (189) 


which, when used in equation (187), leads to an expression for W;(f) 
which is similar to the main result given in Ref. 10 [equation (16) of 
Ref. 10]. 
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The relation c’(u, v, —r) = c’(v, u, rT) may be used to show that Q(f) 
is real and P(f) is even. When y(u) is real, '(—f) = I'*(f), and the 
expression for W;(f) may be simplified. 

Expanding exp [+2c’(u, v, 7)] in powers of c’(u, v, rT) leads to 


Wi) = 8.90 +E DFP do [de 8h = a — + = 
i W (00) | | Si(pr 9705 y Pn) os (= SP (= pi ns) — Pn) |’ 
(190) 


where, as in Appendix C for the unprimed 8’s, 
Sia, ne) Lp) 


= [dur exp (ao) TI fexp (—e2run,) — 1) 
= S’(a,,°°°, o) + -f dp | doy | TT W.(o0 | 


Opie Ppl — Pigs 4 Pp», °°" Ta) 
The series in equation (190) is analogous to the series in equation (90) 
for the more accurate first order approximation based on g(t — u) — 
@(t). The function S’(z,, --- , x,) is the analogue of S(a,, --- , 2x) 
discussed in Appendix B and is defined by 


S’ (2x, ana pea) = i: du y(u) I h(x») 
- 1 yy — 


The second equation is symbolic in that y’ is to bé eapineed by ts 
after expansion of the product. It is shown in Appendix B that when 
| Tf) —1| < ¢ <1 for all real values of f, 


Slay, +++ 5.) = Sy, +++ ym) + OCC). 
APPENDIX E 
Power Spectrum of aé(t) + b In R(é) 

Section III states that an expression for. the power spectrum W,(f) 
of x(t) = a6(t) + b In R(@) may be obtained from equation (15) for 
W,(f) by-replacing U(f), T(e,f—p), and S(p,-0, f—p—o) by (a+4b)U(f), 
(a + 2b) T(p, f — p), and.(a-+ 1b) S(, o, f-— p-—_2),. sta sag Here 
the steps leading to this result are outlined. 
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From equation (6) for the complex phase angle O(¢) we have 
10(t) = In RG) + 200), 
and it follows that, for arbitrary real values of a and b, 
af + bIn Ri) = Re [(a + 20)O0)]. 


Consequently W,(f) may be obtained by replacing O(é) by (a + 7b)O() 
in the analysis which led to equation (15) for W,(f). 

The functions A(r) and B(r) appearing in equation (48) are replaced 
by (a + 7b)’A(r) and |a + 7b|* B(r), and their respective Fourier 
transforms P(f) and Q(f) are replaced by (a+7b)’P(f) and | a+b |? Q(f). 
Each factor in (a + 7b)? = (a + ib)(a + 7b) can be associated with 
factors in P(f), and each factor in (a + 7b)(a + 72b)* with factors in 
Q(f), in such a way that U(f) becomes multiplied by (a + 7b) and 
U*(—f) by (a + 7b)*, and so on. This may be verified by repeating the 
analysis of Sections VIII and IX with the modified expressions. 


APPENDIX F 
Results Obtained from the Series for In[1 + K(t)] may be Asymptotic 


For gaussian ¢(é) with average 0 and rms value ¢, the following con- 
siderations suggest that results obtained from equations (10) and (12), 
namely 


O(d) 


#() —iIn{l + K(d] (191) 


Il 


H(t) +1 Yn [-KOP 


represent the first few terms of an asymptotic series when o — 0. 
Since K(t) is difficult to handle, we replace it by ala**‘“ — 1] where a 
is somewhat like the integral of y(u) between —o and «. The value 
of this integral is 1, and we regard a as being near 1. The series for 
In [1 + K()] behaves somewhat like the series 


In (1 + afexp [4e(0] — 1)) = — 2% fexp fret] — 1)" (192) 


in which the mean square value of the tie of the nth term is 





* exp [- e/2e")] no 2n 
- on)! [2a sin 9/2] de. (193) 
When o < 1 and n is not too large, most of the contribution to the 
integral (193) arises from the region around ¢ = Q, and the integral is 
approximately 
» (2n — 1)(ac)*"/n’. 
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Consequently, the first few terms decrease rapidly when o < l. 
However, when n is very large most of the contribution arises from the 
regions around g = +m, +32, --- , where [sin (y/2)]”” is a narrow pulse 
of height 1 and area 2(7/n)*. When « < 1 only the regions around 
g = 7a are important and the integral (193) is approximately 


o 2n*/?-°? 2a)" exp [— 2° /(20°)] 


which tends to » when ais near landn— o. 

The fact that the rms values of the terms of the series in equation 
(192) decrease rapidly at first and then increase without limit suggests 
that results attained from the somewhat similar series in equation (191) 
may be asymptotic in nature as o —> 0. 
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Frequency Modulation of a Millimeter-wave 
IMPATT Diode Oscillator and Related 


Harmonic Generation Effects 


By T. P. LEE and R. D. STANDLEY 
(Manuscript received May 31, 1968)* 


In this paper we report the performance of a continuous wave millimeter- 
wave IMPATT diode oscillator with a wide-band tunability. The diode 
ts mounted in an iris wafer circuit; its oscillation frequency can be modulated 
either by a varactor diode or by direct modulation of the IMPATT diode 
beas current. The oscillator has been successfully used as a millimeter-wave 
frequency deviator in an experimental pulse code modulation millimeter- 
wave system. 

We also report detailed measurements on subharmonic frequencies in 
IMPATT diode oscillators. Experimental results show that wide frequency 
tunability can be obtained with a circuit which provides an ‘“idler’’ resonance 
at one-half the fundamental transit-time frequency. The results also show 
that by providing ‘idler’ resonances at both the transit-time frequency and 
at one-half of the transit-time, the oscillation at 3 the transit-tume frequency 
ts enhanced and yields a useful output power of 2 mW at 86 GHz. 


I. INTRODUCTION 


A continuous wave (CW) millimeter-wave silicon IMPATT diode 
oscillator was used satisfactorily as a local oscillator in an experi- 
mental pulse code modulation (PCM) millimeter-wave repeater sys- 
tem.” ? That oscillator circuit used a radial-line resonant cavity whose 
resonant frequency was the primary factor determining the oscilla- 
tion frequency. The oscillator was difficult to tune either mechanically 
or electronically. 

While such characteristics are desirable for fixed frequency local 
oscillator applications, other applications which demand wideband 


* The essential results of this paper were presented at the 26th annual con- 
ference on Electron Device Research, Boulder, Colorado, June 19-21, 1968. 
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performance and tunability are also of interest. For example, a mil- 
limeter-wave frequency deviator using an IMPATT diode would be 
advantageous over the L-band deviator and up-converter combina- 
tion in the PCM repeater system. The IMPATT diode oscillator 
could deliver power at least one order of magnitude greater than the 
up-converter. 

A previous paper showed that the circuit inductance of the radial- 
line cavity in shunt with the diode was much smaller than the diode 
inductance.t Therefore, the oscillation frequency could be tuned only 
a few hundred megahertz by varying the diode bias current.* In ad- 
dition, the radial-line cavity was loosely coupled to the external wave- 
guide circuit so that a tuning range of only 300 MHz was obtained 
when a varactor diode was used for frequency deviation. To improve 
the tunability, it is necessary to have (i) the IMPATT diode equi- 
valent inductance dominate that contributed by other circuit elements, 
and (22) the diode closely coupled to controllable external circuit 
parameters. 

It is important to notice that the tunability can be further improved 
by providing more than one resonant circuit for the oscillator. The 
frequencies are harmonically related, and oscillation at each of these 
frequencies is reactively terminated except the one for the output. 
Take as an example the two-frequency case. The tunability of the 
oscillator near the transit-time frequency (say 60 GHz) can be im- 
proved if a lossless resonant circuit at half the output frequency (30 
GHz) is incorporated in the oscillator circuit. The oscillation at 30 
GHz is terminated by a cut-off waveguide and therefore is designated 
as the “idler,” an analog to the idler in harmonic generators using 
varactor diodes. It is not necessary that the low frequency be used as 
the idler. Swan has shown that by providing an idler resonance at 
twice the transit-time frequency, improvements in both the power 
output and the tunability can be obtained.® 

To achieve wide band tunability, IMPATT oscillators were designed 
using resonant-iris structures. It can be shown that the diode equi- 
valent inductance is dominating in this circuit compared with that of 
the radial-line cavity structure. The loaded Q of the iris is about 10 
which provides a wide bandwidth for oscillation. The oscillation fre- 
quency could be tuned over 9 GHz in the 50 to 60 GHz range by vary- 
ing the bias current of the IMPATT diode, thus varying the diode 


*The diode equivalent inductance is inversely proportional to the diode 
bias current. 
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equivalent inductance. It could also be tuned over 3 GHz by using a 
varactor diode closely coupled to the IMPATT diode. 

Notice that by using the fundamental transit-time frequency as the 
idler, the power output at one-half the transit-time frequency is, in 
general, higher than the power output when the half frequency is used 
as the idler in our particular circuit. We refer to the frequency which 
is one-half the transit-time frequency as the subharmonic frequency 
throughout this paper. 

In the sections which follow we describe the circuit structure of the 
oscillator, the performance as a tunable oscillator, and the results of 
frequency modulation of the oscillator in an experimental millimeter- 
wave PCM repeater system. Then measurements of various harmonic 
frequencies existing in the oscillator circuit, and the identification 
and effect of the subharmonic oscillation are described in detail. 


II. DESCRIPTION OF THE OSCILLATOR 


Figure la shows the resonant-iris structure. The iris is made in 
wafer form similar to the Sharpless wafer.* The range of the oscilla- 
tion frequency and the Q of the resonant iris are determined by the 
size, thickness, and shape of the iris aperture. Refs. 5, 6, and 7 give 
details of iris characteristics. The wafers used were 0.100 inch thick, 
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Fig. 1— Artist’s view of the oscillator assembly. (a) IMPATT diode wafer. 
(b) Oscillator mount. 
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and had a rectangular aperture 0.010 or 0.030 inch high and 0.100 to 
0.148 inch wide. 

The IMPATT diodes had a 0.001 inch diameter mesa structure 
and nearly abrupt junctions as reported by Misawa.® Typical diode 
characteristics are given in Table I. The diodes were thermal-com- 


TABLE I — TyprcaL DiopE CHARACTERISTICS* 


Epitaxial layer thickness 2.1ly 

Epitaxial layer doping densityt 6 X 10'* cm73 
Junction deptht 1.04 

Space charge layer widtht 0.75 p 
Breakdown voltage (at 1 A) 19V 
Capacitance at breakdown 0.19 pF 
Junction area A 1.5 X 10° cm? 


* All diodes were from the LO 1114 series. 
+ Measured by staining and interference fringe. 
t Obtained from voltage dependence of capacitance. 


pression bonded onto gold-plated copper studs 0.063 inch in diameter. 
After bonding, the diodes were coated with a thin layer of silicone 
varnish which enchances the mechanical strength of the mesa struc- 
ture. Electrical contact to the diode was made by a 0.032-inch nickel 
rod with a welded contact spring, as pictured in Fig. la. For the 
varactor-tuned oscillator, a varactor diode in a Sharpless wafer was 
mounted adjacent to the iris wafer as shown in Fig. 1b. The relative 
coupling between the IMPATT diode and the varactor diode (thus 
the tuning characteristic) was adjusted by varying the relative posi- 
tion of the two diodes in the plane of the cross section of the wave- 
guide. The complete assembly in a RG-98/U waveguide is shown in 
Fig. 1b. 


III. OSCILLATOR PERFORMANCE 


Figure 2 shows the typical performance of an oscillator without the 
varactor. (The varactor wafer is replaced by a blank wafer.) The 
iris used had an aperture of 0.130 by 0.030 inch. The CW ouput 
power (above 1 mW) and the frequency are plotted as a function of 
the IMPATT diode current. Oscillation began at a diode current of 
about 50 mA compared with 100 mA for the cavity structure.t The 
output power was optimized at each current level by adjusting an 
E-H tuner in front of the diode and a sliding short in back of each 


IMPATT FREQUENCY MODULATION 147 


FREQUENCY IN GHZ 
OUTPUT POWER IN dBm 











oO 
60 80 100 120 140 160 180 200 220 
CURRENT IN MA 


Fig. 2— Maximum output power and frequency as a function of diode current. 


diode. Note that the tunable band (about 10 GHz) is considerably 
wider than that previously reported for a radial line cavity structure. 
By varying the bias current with fixed mechanical tuning, the fre- 
quency could be tuned over 4 GHz with a 3 dB variation in the out- 
put power, and over 9 GHz with a minimum output power of 1 mW. 

Table II summarizes the results of various circuits with different 
iris apertures. Although there are not enough data to draw a meaning- 
ful conclusion, it seems that the oscillation frequency was relatively 
independent of the iris width compared with iris height, which im- 
plies that the inductance of the center post (with contacting spring) 
partially controls the oscillation frequency. This statement is based 
on the assumption that diodes used in the test are of uniform char- 
acteristics since they were made from a single slice and batch pro- 
cessed. The de characteristics of the diodes showed less than 5 per- 
cent variation in capacitance and in breakdown voltage. 


IV. FREQUENCY MODULATION 


As mentioned in Section I, one goal in this work was to design an 
IMPATT diode oscillator circuit which could be frequency modulated 
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TABLE II—REsuuts oF CIRCUITS WITH 
VARIOUS DIODES AND IRIS APERTURES 


Tunable{ 
Width Height Cpt bandwidth 
Diode No. (in) (in) (pF) (GHz) Pmezx (dBm) 
1 148 0.030 0.19 50 — 60 15 
2 130 0.030 0.19 53 — 63 16.5 
3 120 0.030 0.19 52 — 59 11.5 
4 110 0.030 0.18 56 — 63.2 9.4 
5 100 0.030 0.19 53 — 61 10.7 
6 130 0.010 0.16 64 — 70 11.5 
7 148 0.010 0.18 58 — 65 11.1 
t Cz is the diode capacitance measured at breakdown (Vz = —19 volts). 


{| The lower limit on frequency is arbitrarily chosen as the frequency for which 
the power is 1 mW. The upper frequency limit occurs at a bias current of 160 mA 
which is well below the burn-out level for the LO 1114 series diodes. 


and used as a frequency deviator in a PCM millimeter-wave repeater 
system.? Two approaches were taken. The first used a varactor diode 
to tune the circuit susceptance outside the diode; the second used 
the fact that the diode inductance (from avalanching) varies in- 
versely with the bias current. 

Both methods have advantages and disadvantages. The power out- 
put of the varactor-tuned oscillator remains almost constant over 
the frequency band so that amplitude modulation is negligible. How- 
ever, to achieve ultimate performance the circuit is much more com- 
plex. The bias-current tuned oscillator has simpler circuitry, but 
inevitably its power output varies with bias current, which results 
in AM distortion. However, the AM distortion can be overcome by 
proper tuning of the circuit if the modulation index is small. (See 
Section 4.2) 


4.1 The Varactor-Tuned FM Deviator 


The varactor tuned oscillator is shown in Fig. 1b. The varactor 
diodes used were planar diffused GaAs diodes with a honeycomb 
structure.2° The zero-bias capacitance was 0.04 — 0.05 pF and the 
breakdown voltage was 20 volts. The capacitance varied with volt- 
age approximately as C = Cy (1 + V/y)~°-4, where Co is the junction 
capacitance at zero bias, V is the bias voltage, and ¢ is “built-in” 
voltage which is approximately 1 volt for GaAs. The diode was then 
mounted in a Sharpless wafer which in turn was inserted into the 
oscillator mount. The varactor diode was about 0.080 inch behind the 
IMPATT diode. The coupling between the varactor diode and the 
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IMPATT diode was tuned by sliding the varactor wafer relative to 
the IMPATT wafer. Additional tuning was provided by a sliding short 
behind the varactor diode and an E-H tuner in front of the IMPATT 
diode. 

The self-resonant frequency of the varactor diode, as measured by 
the transmission resonance technique, was used to correlate with the 
tuning sensitivity of the oscillator. The self-resonant frequency can 
be varied by changing the length of the contacting wire in the varac- 
tor wafer. It was found experimentally that the best tuning sensi- 
tivity was obtained when the varactor diode was self-resonant near 
the ‘‘idler” frequency. 

A typical frequency tuning characteristic of the oscillator is shown 
in Fig. 3 where the frequency is plotted as a function of the de bias 
voltage (reverse biased) on the varactor. A tunable band of 2.5 GHz 
was obtained. Notice that the linear region was 1.5 GHz with a sen- 
sitivity of 1 GHz per volt. Also notice that the power output varied +1 
dB over the tunable band. 

Frequency modulation with sinusoidal drive on the varactor was 
achieved at both the 160 MHz and the 500 MHz modulation fre- 
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Fig. 3— Performance of the varactor-tuned oscillator. 
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quency. Figure 4 shows a series of the FM spectrum at 160 MHz 
modulation as the modulating power is increased. For the particular 
diode tested, the minimum modulating power required at 160 MHz 
for complete carrier suppression was about 4 mW (to a 50 ohm load). 


4.2 The Bias-Current Tuned FM Deviator 

The second method of frequency modulation was achieved by di- 
rect modulation of the IMPATT diode bias current (the varactor 
diode wafer was replaced by a blank wafer). The modulating signal 
was directly applied to the bias through a coupling capacitor. The 
de tuning sensitivity was between 100 to 200 MHz per milliampere at 
optimum tuning conditions; this resulted in a decrease of 1 dB in 
output power compared with the maximum power obtained without 
frequency deviation. For a sinusoidal drive at 160 MHz, complete 
carrier suppression was achieved at 0.25 mW drive power. Figure 5 
shows the spectrum of the modulated signal at various drive levels. 
When it is tuned properly, no appreciable AM distortion was seen at 
low modulation index. 


4.3 System Performance with the IMPATT Diode Oscillator-Deviator 

The methods of IMPATT diode oscillator frequency deviation were 
tested in a two-level PCM millimeter-wave repeater system.? The 
error rate of the system with the IMPATT diode oscillator used as 
the deviator was measured as a function of the signal-to-noise ratio. 
Figure 6 is a simplified block-diagram of the test circuit. The IM- 
PATT diode oscillator-deviator is driven by a random-word genera- 
tor either through the bias of the varactor for varactor-tuned devia- 
tion or through the bias of the IMPATT diode for current-tuned 
deviation. 

The random word generator produces 160 megabit pulses per sec- 
ond with a random distribution in polarity. These pulses modulate 
either the IMPATT diode current or the varactor diode bias to deviate 
the millimeter-wave output frequency according to the input pulse 
polarity. The IMPATT diode oscillator-deviator output is down con- 
verted to an IF signal in the 1.8 + 0.25 GHz band. The IF signal is 
then amplified and injected through a 500 MHz bandwidth filter to a 
phase-locked oscillator. The function of this oscillator is to act as a 
limiter thus removing amplitude modulation from the frequency 
modulated signal. The output of the phase-locked oscillator is further 
amplified before being injected into the differential phase detector and 
timing recovery circuit. The latter gives two output voltages. The 
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Fig. 4—FM spectrum at 160 MHz modulation of varactor-tuned oscillator. 
Complete carrier suppression is shown in (d) with 6 dBm IF power. Others are 
(a) —9 dBm, (b) —4 dBm, (c) 1 dBm, and (e) 11 dBm (first sideband sup- 
pression). 
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Fig. 5— FM spectrum at 160 MHz modulation on the bias current. (a) IF 
power of —11 dBm for equal sidebands and carrier, Af/f = 1.485, (b) IF power 
of —6 dBm for carrier suppression Af/f = 2.405, and (c) IF power of —2 dBm 
for first sideband suppression, Af/f = 3.832. 
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Fig. 6 — Circuit block diagram for the error rate measurement. 


first output is ideally a replica of the baseband drive signal from the 
random word generator. The second output provides a timing signal 
for the regenerator, which samples the polarity of the first input at 
the time of arrival of the second input. The regenerator output is 
then compared with the random word generator output in an error 
counting circuit. (Reference 12 gives further details of circuit com- 
ponents. ) 

For the random-word modulation just described, the drive required 
at the 160 megabit rate was about 0.1 volt peak-to-peak (to a 750 
load) for the varactor-tuned deviation scheme, and 0.03 volt peak-to- 
peak for the current-tuned deviation scheme at the optimum error-rate. 
Figure 7 compares typical eye-diagrams for the two deviation meth- 
ods tested. For the latter case, the measured error-rate was 107 for a 
16 dB signal-to-noise ratio, which was 2 dB worse than that obtained 
with a 1.3 GHz tunnel-diode deviator.? The major problem appeared 
to be the FM noise on the IMPATT oscillator output when it was 
optimally tuned for best deviation. The noise could have been re- 
duced by increasing the circuit Q and consequently increasing the 
IF drive power. 

Error rate improvement could have been obtained by matching 
the input impedance to the IMPATT bias circuit over the band- 
width of the baseband signal. A narrow band matching results in dis- 
tortion of the baseband pulses. 
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_ Fig. 7 — “Thye-diagrams” of the differentially-coherent-detected random word 
signal. (a) With the varactor-tuned FM deviator. (b) With the current-tuned 
FM deviator. 


V. MEASUREMENTS OF SUBHARMONIC FREQUENCIES 


As mentioned in Section I, the oscillator circuit under study pro- 
vides at least two resonant circuits for the IMPATT diode, namely, 
an output circuit which is resonant at the transit-time frequency in 
M-band (50 to 75 GHz) and an “idler” circuit which is resonant at 
one-half the transit-time frequency in V-band (26.5 to 40 GHz). 
The two-frequency arrangement, with the frequencies harmonically 
related, has improved the tuning sensitivity of the oscillators, as we 
have shown in previous sections. 

Although the existence of such a subharmonic oscillation can be 
deduced from the experiments described below, it is not surprising in 
view of the highly nonlinear nature of the impact ionization and 
avalanche process. Indeed, small-signal theories have predicted that 
the negative resistance exists over, at least, an octave in frequency.??-* 
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Parametric oscillations and mixing have been observed by DeLoach, 
Johnston, Evans, and Haddad.* 1" High-efficiency, low frequency, 
subtransit time oscillations have also been reported by Prager, John- 
ston, Scharfetter, and others,1*"9 and analyzed recently by Scharfet- 
ter and others.?° In Scharfetter’s analysis, the low-frequency oscilla- 
tion is not necessarily harmonically related to the fundamental tran- 
sit-time frequency. In recent work, Swan*® showed that improvements 
in both output power and tuning bandwidth are obtained by pro- 
viding an idler circuit which resonates at the second harmonic of the 
transit-time frequency. This result is similar to our findings except 
that in our case the idler is at one-half the transit-time frequency. 


5.1 Measuremenis 


Since the idler frequency is below the cutoff frequency of the RG- 
98/U waveguide, it is impossible to detect its existence directly at 
the output port of the oscillator as shown in Fig. 1. However, the 
existence of such a signal at the subharmonic frequency would result 
in mixing with the fundamental to produce an output at 3/2 the fun- 
damental frequency. It was found that the next higher-order harmonic 
detected was always 3/2 of the oscillation frequency independent of 
bias current for all the diodes tested. 

For direct detection of the below-cutoff subharmonic oscillation 
dielectric-filled waveguide tapers which had a cutoff frequency of 18 
GHz were used as shown in Fig. 8a. However, a short section of air- 
filled RG-98/U waveguide with about 30 dB attenuation at 30 GHz 
remained between the IMPATT diode and the output waveguide 
taper. This arrangement retained the same subharmonic oscillation 
circuit conditions. Yet, if enough power exists for the signal at the 
subharmonic frequency to pass through the short section of air-filled 
RG-98/U waveguide, both the fundamental and the subharmonic fre- 
quencies should be present at the output port. A mixer and a spectrum 
analyzer were used to detect the subharmonic, while a wavemeter and 
a diode detector were used for the fundamental frequency. There was 
indeed an appreciable amount of power (estimated at 9 dBm at the 
diode) at the subharmonic of 27 GHz; the fundamental was exactly 
twice the subharmonic frequency, or 54 GHz, within measurement 
error. 

When the short section of the RG-98/U waveguide preceding the 
diode wafer was also filled with dielectric the power output at the 
subharmonic was increased appreciably, but the frequency shifted 
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Fig. 8— Waveguide arrangements for measuring the subharmonic signal. 


slightly because of the change in loading impedance. The measure- 
ments of the subharmonic power on several diodes are summarized in 
Table III. 

When the sliding short on one side of the oscillator mount was 
replaced by an E-H tuner followed by a wavemeter and a detector, 
as shown in Fig. 8b, both the fundamental and the subharmonic 
could be measured simultaneously. The results again confirmed that 
the subharmonic frequency was exactly one-half of the fundamental as 
shown in Fig. 9 for diode No. 3. This harmonic relation held true for 
any bias current. 

The E-H tuner in the M-band circuit serves as an impedance 
matching device. By mismatching the oscillator to the load in the M- 
band circuit (thus reducing the 58 GHz power delivered to the load), 
the power delivered to the V-band load increases, and vice versa. Thus 
for efficient operation of IMPATT diodes, the subharmonic (and the 
harmonics) * should be reactively terminated. Likewise maximum sub- 
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TaBLE IJJI—MEASUREMENT OF POWER AT ONE-HALF 
THE TRANSIT-TIME FREQUENCY 


Bias current Frequency Power output 

Diode No.* (mA) (GHz) (dBm) 

1 115 28.138 12.05 

2 129 28.36 11.2 

3 140 29.05 12.0 

5 140 29.51 10.2 

6 150 34.48 16.9 

7 150 32.25 15.2 

8 150 32.98 14.7 

9 150 31.91 11.1 


* The diode number is consistent with that in Table II. 


harmonic power can be obtained when the fundamental and all 
harmonics are reactively terminated. Using this approach, by reac- 
tively terminating both the fundamental and the subharmonic, we 
obtain an output power at 3/2 the transit-time frequency of about 3.3 
dBm at 86 GHz. 


5.2 Harmonic Phase-Locking 


The Fig. 8b circuit arrangement also was used for harmonic phase- 
locking. The locking-signal was injected in the V-band end through 
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Fig. 9— Fundamental and subharmonic frequencies as a function of diode 
bias current. 
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a 10 dB directional coupler. Both the locked signals at the V-band 
and at the M-band were measured. The gain-bandwidth product 
(2Af/fo) (Po/P:)* was about 0.05 to 0.07 for the V-band output.* 
The M-band output frequency was simultaneously phase-locked with 
the locked-bandwidth exactly twice that of the V-band. Since the 
power output in the M-band was about 1 mW compared with about 
6 mW at V-band, the apparent gain bandwidth product was much less. 


5.3 V-Band Circuit with Cap Structure 

The experiments of Section 5.1 were conducted on IMPATT diodes 
in the iris circuit shown in Fig. 1. The same behavior was observed in 
a different circuit structure. A V-band oscillator was constructed using 
a resonant cap structure similar to the circuit described in Ref. 1. 
(see Fig. 10.) Caps were made in various diameters, and could be 
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Fig. 10— A V-band IMPATT oscillator using cap resonator structure. 


slid up and down the center bias rod, thus permitting the frequency 
of oscillation to be varied. A diode with very similar characteristics 
(breakdown voltage and capacitance) as the ones used before was 
selected from the same batch (LO 1114). The fundamental oscilla- 
tion was in the range of 64 to 72 GHz with an output power of 10 
dBm at 67 GHz for 150 mA bias. The subharmonic power was also 
detected. 

To optimize the fundamental power, a V-band to M-band taper 
was used in addition to an E-H tuner so that the subharmonic oscil- 
lation was reactively terminated. Table IV shows the results. Notice 
that when the diode was biased at the same current level, the sub- 
harmonic frequency was close to 14 the fundamental frequency. The 

* Here we define: jf) = free running oscillation frequency 

P, = free running oscillation output power 


' _P; = injected power at frequency fo + Af 
2Af = bandwidth over which the oscillator is pliasectogked 
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TABLE [V—OSCcILLATION IN V-BAND CIRCUIT 
witH Cap-RESONATOR STRUCTURE 


Subharmonic Fundamental 
(Subharmonic reactively terminated) 

Bias Frequency Power output Bias a Frequency Power output 
(mA) (GHz) (dBm) (mA) (GHz) (dBm 

80 382.75 7.5 80 64.95 4.0 
100 32.87 9.5 105 65.7 5.7 
150 33.4 14.0 150 67.0 10.5 
160 36.0 14.0 160 72.0 11.1 


slight difference resulted from the reactive termination of the sub- 
harmonic which required slight retuning for maximum output power 
at the fundamental frequency. 


5.4 Comparison of the Outputs 

Table V compares the fundamental and subharmonic power out- 
puts in the two different circuits. For the iris-wafer circuit the diode 
was mounted in the RG-98/U waveguide; dielectric-filled waveguide 
and tapers were used when the subharmonic frequency power was 
measured. The dielectric-filled waveguide and tapers had an insertion 
loss of 1 dB in the frequency range of interest, which was taken into 
account for the power listed in the fourth column in Table V. For 
the resonant cap circuit, the diode was mounted in the V-band wave- 


TABLE V—CoMPARISON OF FUNDAMENTAL AND 
SUBHARMONIC PowER OUTPUTS 


Subharmonic Fundamental 
Diode Bias Frequency Power Frequency Power 
No. (mA) (GHz) (dBm) (GHz) (dBm) 


Tris circuit 


1 115 28.13 12.05 55.02 9.5 
3 140 29.05 12.0 57.84 9.9 
5 140 29.51 10.2 59.5 7.3 
6 150 34.48 16.9 69.1 11.5 
7 150 32.25 15.2 63.93 ule ae 
8 150 32.98 14.7 66.25 10.6 
9 150 31.91 11.1 63.7 7.6 


Resonant Cap 
Circuit 
4 


10 160 36.0 14 72.2 11.1 
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guide as previously described, so that no dielectric-loading was neces- 
sary. 

In both circuits, the subharmonic power was maximized by re- 
actively terminating the fundamental power, and vice versa. The 
subharmonic power is generally greater than that at the fundamental 
output by about 3 to 6 dB. This is in general agreement with Johnston, 
Schafetter, and others.1°*° However, the current density used here is 
the same as for the diodes operated in the fundamental frequency 
mode, and the frequencies are harmonically related to each other. 


VI. CONCLUSIONS 


By using an idler resonance at the subharmonic frequency, one can 
design oscillators with 20 percent tuning range. Such tunable oscil- 
lators and frequency deviators were built and worked satisfactorily in 
an experimental millimeter-wave PCM repeater system. 

Multiple frequency circuits for IMPATT diode oscillators offer an 
important means of generating millimeter waves with useful power. 
For example, 2 mW of power at 3/2 the transit-time frequency, or 
86 GHz, has been obtained. 
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Minimal Synthesis of Two-Variable 


Reactance Matrices* 


By T. N. RAO 
(Manuscript received July 3, 1968) 


A simple algebraic method stemming from ideas in minimal state-variable 
realization theory ts developed for the synthesis of two-variable reactance 
matrices. The method rests mainly on the factorization of a one variable 
polynomial matrix which is para-Hermitian and positive semidefinite on 
the wmaginary axis, and always yields a realization minimal in both vari- 
ables. 


I. INTRODUCTION 


Two-variable reactance functions and matrices, originally intro- 
duced to represent the characteristics of lumped passive networks 
with variable elements,’ ? have become more important because of 
their application to the synthesis of lumped-distributed networks. 
Ansell first showed the two-variable reactance property of networks 
composed of lossless transmission lines and lumped reactances.? The 
two-variable theory has also been applied to the synthesis of net- 
works consisting of lumped resistors capacitors and uniformly dis- 
tributed RC lines,* > which are of importance in microelectronic struc- 
tures.” 7 Besides the various applications, the two-variable reactance 
theory is of theoretical interest in itself since it can be shown that 
passive RLC synthesis is a special case of two-variable reactance 
synthesis.” 

Koga® demonstrated that every nxn two-variable reactance matrix 
W (p, s) can be realized as the impedance seen at the first n ports of 
a lossless (n+ qr)-port network in the p-plane terminated at its last 
qr ports with unit inductors in the s-plane; q is the rank of W(p, s), 
and r is the highest degree of s in the least common denominator of 


* This work is based on Chapter ITI of the author’s dissertation, “Synthesis of 
Lumped-Distributed RC Networks” submitted in partial fulfilment of the Te- 
quirements for the Ph.D. degree at Stanford University, May 1967. 
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the elements of w. The method is quite complicated and rests heavily 
on the theory of algebraic functions and the structure of para unitary 
matrices. Also it does not guarantee the use of a minimum number of 
elements. Youla® solved the problem of synthesizing a lossless two- 
variable scattering matrix by adapting an earlier method for syn- 
thesizing one-variable scattering matrices.t° The method could be 
adapted to the direct synthesis of an impedance matrix but appears 
to be unduly complicated because of the need to find the transforma- 
tion required to transform a generally unrealizable coupling network 
into a realizable one. 

A simple algebraic method stemming from ideas in minimal state- 
variable realization theory and having similar beginnings as that of 
Youla® is developed here for the synthesis of two-variable reactance 
matrices. The method rests mainly on the factorization of a one- 
variable polynominal matrix which is para Hermitian* and positive 
semidefinite on the imaginary axis. Such a factorization is well known 
in n-port network theory and once it is accomplished, the coupling 
network is obtained by simple matrix operations. Furthermore the 
method always yields a network minimal in both types of elements. 

We first introduce some basic definitions and necessary theorems, 
and later we add more as the need arises. The synthesis procedure is 
developed in Section III. Since the various proofs involved are rather 
indirect and tend to cloud the simplicity of the actual procedure, the 
synthesis procedure is outlined in Section IV. The reader interested 
only in the procedure and not in the theory behind it may go directly 
to Section IV where step-by-step instructions are given for the syn- 
thesis of any two variable reactance matrix. In Section V an example 
is worked out. The notation used in this paper is almost the same as 
found in earlier work to assure easy reading for those familiar with it.° 
Capital letters indicate matrices; bold face letters indicate matrix 
transposition. A superscript dagger indicates the substitution of —s 
or —p for s and 7 respectively, in the case of two-variable functions. 


If. BASIC DEFINITIONS AND THEOREMS 


The basic notion in the two variable theory is that of a two variable 
positive real matrix, which is a straightforward extension of the same 
notion in the one variable theory (See p. 96 of Ref. 11 and p. 32 of 
Ref. 8). 


* A matrix A(p) is said to be para-Hermitian if A(p) = At(p) where the bold 
face letter denotes matrix transposition and the superscript dagger denotes re- 
placement of p by —p. 
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Definition 1: An n X n matrix W(p, s) is said to be a two variable 
positive real matrix if 


(zt) W is real for real p and s. 
(iz) W is analytic in the domain Re p > Oand Res > 0. 
(iz) W + W* is positive semidefinite in the domain Re p > 0 and 
Res > 0. 


By statements such as: “W is analytic” in the definition and in 
what follows, we mean, “each element of W is analytic.” A two vari- 
able function is said to be analytic at a point if it has a total differ- 
ential at the point. The bold face letter indicates matrix transposition, 
and the superscript star indicates the complex conjugation of each 
element. 

If W(p, s) satisfies conditions (ii) and (iii) of the definition and not 
necessarily condition (i), it will be called a two variable positive 
matrix. 


Definition 2: Ann X n matrix W(p, s) is said to be a two variable 
reactance matrix if 


(t) W is a two variable positive real matrix. 

(i) W+WwWt=0. 

The superscript dagger indicates the operation of substituting —p 
and —s for p and s in the original matrix. This definition of a two 
variable reactance matrix is similar to the corresponding one in the 
one variable theory. (See p. 102 of Ref. 11 and p. 32 of Ref. 8.) Anal- 
ogously, as in the one variable case (p. 117 of Ref. 11), it is generally 
hard to check if condition (72) of Definition 1, which involves the 
whole domain Re p > 0 and Re s > 0, is satisfied for a given two 
variable matrix; we would like to find an equivalent set of conditions 
that are easier to check. In the case of two variable reactance matrices, 
the following theorem proved by Ozaki and Kasami? in the scalar 
case, and extended to nonsymmetric matrices by Koga, (p. 33 of Ref. 
8) serves this purpose. 


Theorem 1: The necessary and sufficient conditions for ann X n matrix 
W(p, s) to be a two variable reactance matrix are: 


(t) W ts rational in p and s, and real for real p and s. 
(it) W is analytic in the domain Re p > 0; Res > 0. 
(iz) W = —Wi. 
(wv) For any (po, So) with Re po = Re s = 0, which ts a regular 
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point of W, poles of W(po, s) and W(p, so) are simple and restricted to 
the imaginary s and p axes respectively. 

(v) OW/dp and dW/ds are positive semidefinite Hermitian for Re p = 
Res = 0, except at poles. 

The proof of this theorem can be found on p. 33 of Ref. 8. We will 
interpret the above conditions on a physical basis. Assuming that a 
network realization consisting of reactances in the p and s-planes 
exists for W(p, s), condition 7 is fairly obvious, since the general loop 
impedance will be a real rational function in p and s. Condition 7 is 
also an obvious consequence of this reason, since the substitution of 
—p and —s in W(p, s) is equivalent to changing the sign of all ele- 
ment values and hence of every branch and loop impedance. Under 
the assumption of existence of a two element kind of reactance net- 
work corresponding to the given W(p, s), condition ww is also clear, 
because p is fixed as a pure imaginary number, the p-type elements 
can be considered as “frequency insensitive reactances,” and their 
presence in a network consisting of pure reactances in the s-plane can- 
not create poles off the imaginary s axis. Similar reasoning justifies 
condition v for s fixed at any imaginary number, the positive semi- 
definiteness of dW/dp can be considered as an extension of the posi- 
tive slope of a reactance function in the one variable theory. 

The necessary and sufficient conditions for a two variable reactance 
function are not discussed separately, since scalars can be considered 
as a special case of a reactance matrix. 

If W(p, s) has a pole at p = po, independent of the value of s, 
is said to be an s-independent pole of W. The following theorem (see 
p. 34 of Ref. 8) concerning such poles is important for the synthesis 
method to be given. __ 

Theorem 2: A two variable reactance matrix Wo(p, s) can be de- 
composed as 


Wo(p, s) = Wilp) + W2(s) + WO, s) 


where W, and W, are reactance matrices in p and s, respectively, and W 
ts a two variable reactance matrix with no p-independent or s-independent 
poles. 

Any given two variable reactance matrix W,(p, s), by virtue of the 
above theorem, can be realized as a series connection of networks hav- 
ing W,, W2, and W as their impedance matrices, as shown in Fig. 1. 
Since W, and Wz can be realized by existing techniques (See chapter 
7 of Ref. 11) the given W, can be realized if a method of synthesis is 
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Wo (P,S) —> N-PORTS 


W(p, Ss) —»> O 
N- PORTS 


Fig. 1— Interpretation of theorem 2. 


found for W. Henceforth we assume that the given reactance matrix 
has no p-independent or s-independent poles. 


III. SYNTHESIS OF TWO VARIABLE REACTANCE MATRICES 


Let us assume that there is a passive n-port network representation, 
consisting of p- and s-type reactances, gyrators, and ideal transform- 
ers, for a given two variable n X n reactance matrix W(p, s). In such 
a network it is always possible to replace each s-type capacitor by a 
gyrator-s-type inductor combination and then isolate all the s-type 
inductors, of which we assume there are k, as shown in Fig. 2, without 
changing the impedance seen at the prescribed ports. If we further 
assume that the (n + k)-port coupling network, consisting of p-type 
reactances, ideal transformers, and gyrators has a Z matrix, then the 
impedance matrix W (p, s) seen at the first n ports is given by 


W(p, 8) = 2p) — 212(p)[Z22(p) + sli] 221 (p) (1) 
where Z(p), the impedance matrix of the coupling network is given by 
Up) = ro | (2) 

Z21(p) 220(p) 


Since the coupling network is a lossless network in the p-plane 


Z= —Zt (3) 
and we have 
W(p, 8) = 21:(p) + Z12(p)[ze2(p) + sli] "Z12(p). (4) 


Next we show, by algebraic means, that every two variable reactance 
matrix can be decomposed into the form in equation (4), such that 
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n k 
ee Zy(P) Za(p)|n 
Zai(P) Zee(p) | k 


Fig. 2— Extraction of S-type inductors. 
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N-PORTS —> 
W(p,S) —> 


Z(p) of equation (2) describes a lossless network. Once such a decom- 
position is found, we can realize the given W(p, s) by realizing Z(p) 
by any of the existing techniques (see chapter 7 of Ref. 11) and 
terminating it at its last k ports with unit inductors in the s-plane. 

To establish that any given two variable reactance matrix W(p, s) 
can be decomposed as shown in equation (4), we first expand W(p, s) 
and the expression on the right side of equation (4) about s = o and 
find the expressions that relate 211, 212, and Ze. with the expansion 
coefficients of W(p, s). We then show that a set 211, 212, and Z22, which 
satisfies the above relations and at the same time guarantees that the 
Z(p) of equation (2) is a reactance matrix in p, can always be found. 

The given two variable reactance matrix W(p, s) can be assumed 
to have no p-independent or s-independent poles by virtue of Theorem 
2 and hence can be written in the form 


Bo(p)s’ + Bi(p)s’* + +++ + B,(p) 

ao(p)s’ + ay(p)s* + +++ + a,(p) 

where the B;(p) are real polynomial matrices in p and the scalar 
g(p, 8) = ao(p)s’ + ay(p)s™* + +++ + a,(p) (6) 


is the least common denominator of the entries in W(p, s). For any 
ordinary value of p, W(p, s) can be expanded in the neighborhood of 
S = 0 as? 


W(p, 8) = (5) 





Wo, 8) = Ag) + Ae. (7) 


l=0 § 
Expanding the right side of equation (4) in the neighborhood of s = 
00 1 _t 
zu(p) + 2ule)lealp) + slid 20) = en + (—D! DB. 
=0 


For the equality in equation (4) to hold, we identify 
Zi(p) = A_,(p) = W(p, &) (9) 
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and 
Ap) = (—l)'eueieti2 «= 1 = 0, 1,2, -°-. (10) 
Since the Z(p) formed out of 231, 212 and 229 
Z(p) = ss, * (11) 
t . 
—Zy2 222 
has to describe a lossless network in the p plane, we must have 
Z = —Zt 
as given by equation (3), and hence 
ay = —Zh (12) 
and 
222 = — 230 : (13) 


With the identification in equation (9), equation (12) is always satis- 
fied, since by equation (9) 


4 = W (p, «) = —W(=?p, —), 


and thus 23, is uniquely determined. The problem is to chose a pair 
212, 222 to satisfy equation (10) and at the same time guarantee that 
equation (11) describes a lossless network in the p-plane. For Z(p) to 
describe a lossless network, it must be positive real and satisfy equa- 
tion (3). 

Before proceeding further, we would like to know more about A;(p), 
the expansion coefficients in equation (7). By equating the right sides 
of equations (5) and (7), 


By(p)s’ + B,(p)s = a nae aie B,(p) 
= [ag(p)s" + ape"? + +++ + a,()]| As cD A) |. (14) 


i=0 8 





Equating coefficients of like powers of s on both sides of equation (14), 
(see p. 207 of Ref. 12, Vol. IL). 


ax(p)A-1() = Bo(@) 
a,(p)A-(p) + ao(p) Aol) = Bx) 
ax(p)A-s(p) + a4(p)Ao(p) + ao(p) Ail) = Bol) (as) 
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a,(p)A-s(p) + a,-1(p)Ao(p) + ++ + ao(p)A,-1(p) = B,(p) 
and 
A(p)A(p) + a(p)Ai-i(p) + --- + .4,(p)A:-.(p) = 0, for 7 2 r. 


From equation (15) an expression for A;,(p) can be written* in the con- 
venient form (see p. 14 Ref. 9) 


Bop) — ao(p) 0 0 eee 
B, (p) ay (p) Ao (p) 0 0 
A,(p) = a Bao) ax{P) a(P) aa(P) - . (16) 
Bip) ap) a-ilp) ai-e(p) +++ ao(p) 
Bisilp) Qiar(p) ap) a-i(p) +++ ap) 
i= —1,0,1,2,-:- 


B; =0, for l>r 
a, = 0 for 1>r 


where the (1 + 2) x (lJ + 2) determinant is expanded formally in 
terms of its first column. In equation (16) the B; are matrices, the a, 
are scalars, and the determinant is not a determinant in the usual 
sense. From equation (16), it can be seen that A;(p) is of the form 


real polynomial matrix in 
Ai(p) = Se agg (17) 


Another important property of the A,(p)’s is obtained from the rela- 
tion 


W(p, s) = —W(—p, —8) 


which implies 


A.) + DABP = -a(-») — Dnt ASP. as) 


1=0 


Hence by equating like powers of s 


*Alternate methods of obtaining these A i(p)’s are by diffe rentiation of W(p, s) 


eoipy ,s 
Ai(p) = ges) 


or by long division. 
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A; = (—1)'Al. (19) 


If, for the purpose of choosing a pair 232, Zee that satisfies equation 
(10) and, at the same time, guarantees that the Z(p) of equation (11) 
describes a lossless network in the p-plane, we define P;(p) as 


[ #12 
| 
) Zy2%20 
Zz 2 
_ | @12222 
Pi(p) = ||. (20) 
l 
212222 
Then 
Tt tT 7 at _ aft it_t 
212Z12 212Z20Z12 Z1eZ20Z,2 °° *  By2Zo0Zi0 
T t Tt ot ft wt _t 
212%99Z 12 2 12%29Zo0Z12 «= 2%20Ze0Z12 °° * Sy2%22Z22Z12 
2 t 2 Tt Tt 2 _2t t 2 it _t 
t _ | 212%22Z12  Z12%22Z20Z12 «212% 22Z20Zo0 °° * 2 2%22Z00Z 12 
PRs (21) 


ut _t ttt 1 _ot_t l _it_t 
212%92Z12 212%29Z22Z12 2 12%22Z20Zo2 °° * 212%22Z01Z12 
e ° 5 ° < - ‘+ t 
In the above matrix, the entry in the 7th row and jth column is 22235235219 
and by equation (13) 
. st T . i+ ° t 
Zre@a2Zo0Z12 == (—1)’ey2%02 ‘Zio - (22) 


Since we wish the equality in equation (10) to hold 


PE A = (—1) 2.2252? Z19 ={> 1)‘ Ajs; . (23) 
If we define T;(p) as 
A,(p) A,(p) A,(p) oe A,(p) 
— A,(p) — A,(p) — A;(p) ook = oe AED) 
Tp) = A,(p) As(p) A.D) 7 : ArsalP) 


(—1)'Ai(p) (—1)'Ai41(p) (—1)'Ai42(p) ares (—1)'Ani(p) 
(24) 


from equation (23), we can see that 


TP iP, ; s: wheas [B5) 
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Equation (25) suggests that a way of obtaining a pair 212, 222 would 
be to form the matrix 7T;(p), factor it in the form of equation (25), 
and then try to identify 21. and Zo. from these factors. We do not know 
in advance if the matrix 7;(p) formed from the expansion coefficients 
of W about s = o can always be factored as indicated in equation 
(25) ; hence we first study the properties of T,(p), to see if it can be 
factored in the desired form. 

Consider the matrix T,(p) when | = r, r being the s-degree of g(p, s), 
as given in equation (6), 


Ao A; Ag Lens Ari Ar 
—A, —A, =a Ae Pian ie 
A A wisps 7 is 
T= -" 7 — je vey (26) 


(=the (<A, (<1) gg (HD) (HD) 
LDDs (HDA Cel) Ag CA C1) Aer 
The matrix obtained by deleting the last column and row in equation 
(26) is T,1, and by equation (15) it is easy to see that the last column 
is a linear combination of the first r columns. Hence* 


rank J, = rank T,_, 
and 


rank T, = rank T,_, for l2r-—1. 


The rank of T,_; is connected with the s-degree 8,[W(p, s)] of W(p, s) 
which is defined in Definition 3 (see p. 10 of Ref. 9). 


Definition 3: The s-degree of a rational two variable matrix W(p, s) 
is obtained from the rule 


s = degree of W(p,s) = 6,[W(p,s)] = max 6[W(p , s)] 


where 8[W(p., s)] is the McMillan degree (see part II of Ref. 13) of 
W (po, 8). For any fixed p,, W(p., s) is a matrix of rational functions 
in s with its McMillan degree uniquely specified; hence the above 
definition uniquely specifies the s-degree of W(p, s). The relationship 
between the s-degree of W(p, s) and the rank of T,_, is stated formally 
in the following lemma. 


Lemma 1: The rank of T,-,(p) ts equal to the s-degree of W(p, s). 


* By the rank of rational or polynomial matrix we mean the “normal rank,” 
which is defined to be the rank everywhere ii at a finite number of values 
of the variable. 
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The proof of this lemma for the one variable case can be found in 
Ref. 14 and on p. 200 of Ref. 10, and for the two variable case on p. 17 
of Ref. 9. 


To show that the matrix T,_,(p) can always be factored in the form 
of equation (25), we need the following lemma. 
Lemma 2: The matrix T,-,(p) defined by equation (24) forl = r — 1 
satisfies 
@) T,1 = Tr, 
(at) T,-\(jw) is Hermitian and positive semidefinite. 


Proof: 


Since 4, = (—1)'A} by equation (19), the proof of 7 is readily seen 
from equation (26) 


| ae lene (27) 


To prove (72), we first notice that by Theorem 1, for any real w, W (ja, s) 
has only simple poles, which are restricted to the imaginary axis in 
the s-plane. Hence W (jw, s) can be expressed in the partial fraction form 


WGe, 3) = AaGs) + Ae (28) 
where, f;(jw) are the residue matrices at the poles ja,;(w), and the a;(w) 
are real. 
It is shown in Appendix A that the R;(jw) are Hermitian and positive 
semidefinite for each w. Now, if each term in the sum on the right side 
of equation (28) is expanded about s = ~, we have 





We, 8) = AaGe) + OY VY Rae. (29) 


For the purpose of comparison, equation (7), written with p = jw, is 


WG dag ges = (30) 


a=o0 § 





The right sides of equation (29) and (80) are expansions of W(jw, s) 
about s = «©, and because of the uniqueness of a power series expansion 


AdGa) = ¥ Ga) RlGs). (31) 


By noting that the a; are real and the R;(jw) are Hermitian and positive 
semidefinite for each w, we have 
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Agta) = Do RalGa) > 0 (32)* 
AiGs) = 5 D aati) (38) 
Aaja) = — DV ofR(js) 50 (34) 
Aum-s(is) = — 3D of" *RiGa) (35a) 
Asn-o(is) = — Dol" Riis) 5 0 (35) 
Asm-iGis) = —§ D of" Ria) (350) 
Aunlis) = Dol"R.Gs) BO. (354) 


By direct substitution of equation (33) into equation (24), T,-1(jw) 
can be written as 


R; ja:R; —aiR; ae (ja,)'"'R, 

—jaR; oR; joi ; es —(ja;)’R; 
. . _ PR; —j AR; : i athe j R: 36 
Ta) = S| ia aR Gay"'R, |), (86) 


(—1)"Ga)" "RR: (1) "Ga)"Re (— 1) Ga) Re ++ (1) Ga)? 


The matrix sum on the right side of equation (86) can be written 


R, 0 ae 
0 0 0---0 

pie oe Oe (37) 
E 0 0---0 


where 


* By the notation A 2 0 or A S 0, we mean that the associated Hermitian form 
of A is positive semidefinite or negative semidefinite. 
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te 0 0 -O 7" 
jail, 1, 0 ---0 
0 1 


L; = ail, ge SSO elt og (38) 


CAD Qa) de Os Oe eee ad, 


Since each R;(jw) is Hermitian and positive semidefinite for each a, 
the sum on the right side of equation (37) is also Hermitian and positive 
semidefinite. Hence, we have proved the lemma. 

We have shown that 7,_,, a matrix of rational functions, is para- 
Hermitian and positive semidefinite on the imaginary axis. Such a 
matrix can always be factored in the form shown in equation (25), 
(see p. 133 of Ref. 15). It is tempting to factor 7,_, at this stage and 
find 2,2, 222 to satisfy the required conditions, but we will factor aj” T,-, 
instead of T,_, for the reason that the factors would be polynomial 
matrices. 

From equation (17) we can see that a” T,_, is a polynomial matrix 
in p. To be able to factor aj” 7,-, in the required fashion, we have to 
show that 7 = a?’ T,_, is para Hermitian and positive semidefinite on 
the jw axis. To do this, we obtain the required additional information 
about the polynomial a,(p) from the following theorem. Since the 
theorem contains more information than we need at this point, we will 
only state it here; a proof is given in Appendix B. 


Theorem 8: If 


_ Bo(p)s” + Bi(p)s"™* + ++: + Be) 








W(p,s) = : os 
9) = aps’ + alps + ap) 
is a two variable reactance matrix, then for allz = 0,1, ---,7r 
B, : eo 
(2) i is a reactance matrix in p 
(it) is a reactance function in p 
t+1 
(212) a; has all its zeros on the jw axis and these are simple 


(iv) XB;X forall constant real X 1 vectors, X, is a reactance 
XB;,,X function in p 
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From this theorem, a,(p) can be represented as 
ap) = p’ I] (* + ai) = +a(—p) (39) 
where vy = 0 or 1. Hence 
PS as Bs (40) 


From the form of a, shown in equation (39) and Lemma 1, it can be 
seen that 


T,s(jo) = 0 (41) 
except when simultaneously, »y = 1 and r is odd; in which case 
Tes 1 (jw) = 0. (42) 


We will assume that 7,_,(jw) = Oin developing the synthesis procedure 
and discuss the needed modification when 7,_,(jw) < 0 later. 

If the s-degree of W(p, s) is equal to k, by Lemma | the rank of T,_1(p) 
and hence of 7,_,(p) is k. Since 7,_, = T!_, and 7,_,(jw) = 0 there 
exists a factorization *°'*” 


T,-1(p) = M(p)M*(p) (43) 


where M(p) isan nr X k polynomial matrix and has a left inverse M~*(p) 
which is analytic in Re p > 0. 
From the definition of 7’,.,, we have 


, 
T,(q) = SEE). (44) 
M (p) can be partitioned into n X k blocks M;(p) 
Mal) 
M(p) 
Mp) =| 77" (45) 
M,.(9) 
and hence 
M'(p) = [Mo(p) ! My(p): «++ | Mi-s(p)]. (46) 


_ Now by comparison of equation (45) with equation (20), we can im- 
mediately identify a suitable z,, as 





42 > ay ° (47) 
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To find a suitable 222, if we define T, as 








[ —Ay =A, —A,; ++: —A, 
A, A; Ag Ta eS Ayat 
=A A eZ. es A as 
Tip) =| -* ss (48) 
pr, (—1)"Aray (—1)"Ayss See (= 1) Ase 
from equations (20), (21), and (25) we see that 22. must satisfy 
1 t 
or MeZ2.2M. — TT; “ (49) 
Qo 


Even though equation (49) does not uniquely specify 222, we can choose 
for 209 


Zon = MOT MO. (50) 

From equation (19) and the definition of T,, we see that T; = at bl 
and hence 

222 = —Zyp. (51) 


We now notice that by construction, the pair 2:2, 222. defined by 
equations (47) and (50) satisfies 


(1) 212230212 = A; (10) 


for all0 <1 S 2r — 2. Our aim is to find 2;, and 2s. that satisfy equation 
(10) for all! = 0. It is not immediately clear that the pair 212, 222 defined 
by equations (47) and (50) satisfy equation (10) for alll = 0. 

To see that the chosen pair 2,2, 222 does indeed satisfy equation (10) 
for alll = 0 and not just for0 S$ 1 S 2r — 2, we introduce the generalized 
companion matrix Q(p) defined by’? 


0, 1, 0,, bas starts 0, 
0, 0, 1, weer was 0, 
ad 
0, 0, 0, sm ~ Oy im 
_ a R _ Gr-1 hi _ ar-2 1, _ a2 1 Oh 1, 


Qo Ao Ao a." Qo 
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From equation (15) it can be seen that 
eS S238: (53) 
Hence, by equation (48) 
—a? MT, ,QM™' 
—M"MM'eM’ 
—M'omM", 


222 


and by equation (51) 

Zoo = —M'QM™' = M0'M. (54) 
Hence 
—M-o'WM' em" 
—o2’M09'T,_, QM" 
—o@'M0?'T,_,.M7" 
—M-'9?'u 


2 
£02 


and 
ao = (-D' Mm 0"'M; 1>0. (55) 


From the definition of 2, we see that g(p, £) is its minimal polynomial, 
and hence the matrix polynomial 


g(p, Q) = a2" So a,Q"~* ar os ee =f Glas 


Il 


Ons (56) 
and hence 
g(—p, W) = One. (57) 
By equation (55) 
g(p, 222) = Mo* (—1)'*a,Q"t + (—1)"7a,Q"""t + +--+ +.4,1,, M, 
and by equation (57) and Theorem 3 
9(D, %22) = + M'[g(—p, M)\M = O,. (58) 


From the last equation in equation (15), from equation (58), and from 
equation (10), which holds for 0 S$ 1 S 2r — 2, 


Ay Ao,-1 = —G,Az,-2 — GgAz,-3 — +++ — G,A,-1 
Qr—-2 2Qr-2 r-15_1 
a —210[@,225 + Ge%9 °° e+? tb G,2a9 ]Z12 


2Qr—-1,_7 
—~Qp%12%22 = Zio - 


I 
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Hence 
Aga = (et ee ’ 
and by induction 
A, = (—1) 2.232212 


for alll = 0, which is the same as equation (10). 

We thus have a set of three matrices 2,,, 212, 222 such that the in- 
finite set of equations obtained by equating the right sides of equations 
(7) and (8) are satisfied. Hence the right side of equation (4) and W(p, s) 
have the same Taylor’s series expansion in the neighborhood of s = o. 
By analytic continuation, for all p and s 


W(p, 8) = 2u(p) + 212(p)[222(p) + s1,]"'z2(p), 


where 211, 212, and 2. are defined by equations (9), (47), and (50), 
respectively. 

We have thus succeeded in decomposing W(p, s) as shown in equation 
(4). It now remains to show that Z(p) formed from the chosen 2, , 
Z12 , ANd Zy0 


Zp) = 21) oe 


—Zo(p) Z20(p) (59) 


Wp, ©) ce 


as’(p)M~(p)T'a(p)M* '(p) 


_ Map) 
ay (p) 





is a reactance matrix. 

To show that the Z(p) in equation (59) is a reactance matrix, we may 
choose any standard test, but we will choose the one given below since 
it is particularly suited for the problem at hand (see pp. 117 and 123 
of Ref. 11): 


Lemma 8: The necessary and sufficient conditions for a square matrix 
Z(p) to be a reactance matrix are: 


(t) Z is rational and real for real p. 
(it) Poles of Z(p) are simple and restricted to the tmaginary axis. 
(iat) Z+ Zt = 0. 


(2v) Residue matrices are positive semidefinite Hermitian. 


Since all the entries of Z(p) in equation (59) are real and rational, con- 
dition 7 is satisfied. 
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From equations (18a) and (15), 2,, = A_, = Bo/a is a reactance 
matrix by Theorem 3; hence its poles are simple and restricted to the 
imaginary axis. Also, the pole at p = ©, if any exists, is simple for 
this block. Since M, is a polynomial matrix, it is clear that the poles of 
the off diagonal blocks z;, and —z/, are in the zeros of ap and hence by 
Theorem 3 the poles of 2:2 are restricted to the jw axis. However, it is 
not clear that these poles are simple. To show that they are indeed 
simple we will use the fact that A, defined by 





A, = aA (60) 
is a polynomial matrix. From equations (10) and (47) 
t 
Ay = = oM (61) 
Qo 


and from equation (39) a = -a,. We first consider a) = a; in which 


case 
ramen ee Me, | 
a, = | ¥s || Bs 2) 


Equation (54) then shows that 4, = A} and A,(jw) = 0. Hence there 
exists an n X q polynomial matrix, Q, such that 

where g is the rank of A,. Equations (62) and (63) are two different 
factorizations of A, , hence:”” 


M : 
ar? = Q[1,: Onx(r-a]V (64) 


0 











where V(p) isa k X k para unitary matrix, that is, VV' = 1,. Since Q 
is a polynomial matrix, and V(p) being para unitary can have no poles 
on the imaginary axis (see p. 186 of Ref. 11), the left side of equation 
(64) can have no poles on the imaginary axis. Hence aj’, which has 
all its zeros on the jw axis, must divide M,. Thus 2,2 has all its finite 
poles in the zeros of a). By Theorem 3, the zeros of do are simple and 
restricted to the jw axis. In the above, we have assumed that a) = a, ; 
if a) = —a, and r is odd, the same proof holds; if 7 is even we can con- 
struct a similar proof by factoring —A, instead of A, . 

To show that the pole of z,. at p = ©, if any, is simple. Consider 
the following representation for A» obtained from equations (15) and (17) 


aoB, — Bo _ Bi di _ Bo a 


Ay = 2 
Qo a, Ao Qo Ao 





(65) 
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Since B,/a, and By/ay are reactance matrices and a,/d) is a reactance 
function, according to Theorem 3, the right side of equation (65) be- 
haves as Kp” near p = ©, where K is a constant matrix and » is an 
integer such that —2 S$ v S 2. But 22 satisfies 


+ 
Ay = 212212 


and hence the pole of z,, at p = ©, if any, must be simple. 
We now have to show that the 2.2 block also satisfies condition (72) 
of the lemma. By equation (54) 


292 = —Mtiagmt = MoarM. (54) 


Since M~ is analytic in the open right-half plane and @ has all its poles 
in the zeros of a) , by equation (54) the poles of 222 are restricted to the 
jw axis. To show that these poles are simple we will prove by contradic- 
tion that a2.2 is polynomial. 

From equation (52), the definition of 2, and equation (54) we see that 
if GZ22 has a pole of multiplicity a at p = jw. In the neighborhood 
of this pole, we have the approximation 


K 
Ao%22 wd) hs atte (66) 
(p — joo) 
where K is a constant matrix and a is a positive integer, and 
KK? 
Oseo2 RS (67a) 
(p — Jo) 


Now by equation (55) 22, = —M~'0?tM, and hence in the neighborhood 
of p = jw 

ad, ey —H1 (67b) 

(p — jor)” 

where K, is a constant matrix and @ is a positive integer. Since the poles 
of A222 are contained in the poles of M~*, 8 S 2a. By comparison of 
equations (67a) and (67b), which must be equal, it is clear that either 
a = 6 =OorK, = K’ = 0. Since 4, = —z},, K = K*, and hence 
K’ = KK* = 0 implies that K = 0. Thus aoz22 can have no poles on 
the jw axis and this, coupled with the fact that z.. can have poles only 
on the jw axis, guarantees that ay2z2. is always polynomial. We therefore 
conclude that all the finite poles of 2.. are in the zeros of a , and their 
multiplicity cannot exceed that of the corresponding zeros of a) . Hence, 
again by Theorem 3, all the finite poles of 22. are simple ang pesucred 
to the jw axis. 
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To show that the pole at p = © of 22., if any, is simple, consider 
equation (15) written in this form: 


As 2 
Qo 

ye By a Bo (68) 
Q, Ap Ap Ao 


Ae 2s: B, _ a2 Ay ay |B ay Bo 4 Az a, Bo 
l= ual pecs | [clan A | il a 
a, Ao Ay Ao &, Ap Ao 


Owing to the reactance nature of B;/a; and a;/a;,, by Theorem 3, 
and from the form of A; shown in equation (68), near p = ©, A; behaves 
as 


A; & Kyp"' (69) 
where K; is a constant matrix and y; is an integer such that 
tp 2S ieee 2). (70) 
Also from equation (10) 
Zr@igkig = (—1)'A, & EKip”. (71) 


Since 2,. has at most a simple pole at p = o, in the neighborhood of 
p= © 


212 & Kp’ (72) 


where K is a constant matrix and / is an integer such that 1 < 1. If 
Zoo behaves as K2.p” near p = ©, where K., is a constant matrix and m 
an integer, then by equation (70), (71), and (72), (¢ + 2) = im+ 212 
—(z + 2). For such to be true for any fixed / and all integral 7 = 0, 
m has to be less than or equal to unity. Hence the pole of 2.. atp = ©, 
if any, is simple. 

We have thus shown that condition 22 of Lemma 1 is satisfied for 
each block in Z(p), and hence Z(p) also satisfies it. 


Since z,, is a reactance matrix, z,, = —Z1, and 2.2 = —Zs, by equation 
(51), we have Z = —Z? and thus condition (77) of the lemma is also 
satisfied. 


Now to complete the proof that Z(p) is a reactance matrix, we have 
to show that the residue matrices at the poles are positive semidefinite 
Hermitian. To do this we need Lemma 4, which follows from the defini- 
tions of a two variable positive real and two variable reactance matrices 
(see p. 34 of Ref. 8). 
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Lemma 4. If W(p, s) ts a two variable reactance matrix with no p-in- 
dependent or s-independent poles, W[p, s(p)] ts a reactance matrix in p 
for any reactance function s(p). 


To prove that Z(p) satisfies condition 7v of Lemma 1, which requires 
that the residue matrix of Z(p) at any of its simple poles on the jw axis 
is positive semidefinite Hermitian, we note that at any pole, p = ju, 
of Z(p), if we set 


21 
s(p) = D a for |w. |< © 





= Ip for Wp = 
in 
W(p, 8) = 211 + 212(@22 + sl.) Zio 
[which is equation (4)] then by Lemma 4, Wl[p, s(p)] is a reactance 


matrix in p for all positive 1. Since Z(p) is real for real p and Z = —2Z}, 
the residue matrix H at the pole p = jw is Hermitian; if we write it as 
He iS ay (73) 
Hy, Ho» 
then, K, the residue matrix of W[p, s(p)] at p = jwo is given by 
K = Hy, —_ HH \o(Hoe “ l1,) "Hs : (74) 


Since H, H,, , and H.. are Hermitian, there exist unitary matrices U, 
and U, such that 


Ay = U*H,,0, = diag [d;, ds, °++, d,l (75) 
and 
Ao. = U%H..U, = diag [Ay , Ao, -** , Aad. (76) 
Hence 
UtKU, = yy — Sio(Ace + U1) JA (77) 
where 
Jin = U*H,.U2. (78) 


If J,.; denotes the zth column of J,, , the right side of equation (77) can 
be rewritten as 
k 
1 
* = = ——_. . 7%. 
UiKU, = An — Dep Jad - (79) 
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Since K is the residue matrix of a reactance matrix, for alll > 0, K is 
positive semidefinite. A,, is also positive semidefinite, since H,, is 
the residue matrix of the reactance matrix 2,. Ji2:J%; is obviously 
positive semidefinite and the left side of (79) can be positive semi- 
definite for all positive / only if all the \; are nonnegative. Hence A.. 
and He. are positive semidefinite. 

To show that # is positive semidefinite, we will show that H’ defined by 


H’ = (Ut + UD)H(U, + U,) = Be a (80) 
Jiz Ago 


is positive semidefinite. For this purpose, consider the Hermitian form 


[x* X%] i. das * 
1) Det le 
= X*A.X. + X4JHX, + Xt 2X. + X*D2X. (81) 


where 
Dos = Ago + U1, ; L> 0. 
Since 
UtKU, = Ay — JuDulis 


is positive semidefinite, we obtain from equation (81) the following 
inequality: 


[x* 5] ie ‘| | 
Jiz Doe} | Xe 
= XJ pDa2 JX, + XiJiwX. + XEJhX, + KGAwX.. (82) 


Since the right side of equation (82) can be expressed as G*G, where 
G = [DiM?7*%X, + Adz?X,], the Hermitian form in equation (81) is 
positive semidefinite for all 1 > 0; by a continuity argument we can 
see that H’ and consequently H are positive semidefinite. 

We have thus shown that Z(p) does indeed describe a lossless network 
in the p-plane and thus W(p, s) has the network representation shown 
in Fig. 2. . 

In the development of the synthesis procedure we assumed that 
a2’ (jw) T,-1(jo) = T,-1(jo) = 0. If simultaneously, ao(p) is an odd 
function of p [in other words vy = 1 in equation (89)] and 7, the s-degree 
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of the least common denominator of W(p, s) is odd, then T,_,(jw) < 0 
In this case we factor —7',_,(p) which is para Hermitian and positive 
semidefinite on the jw axis. We will then have 


—T,.. = MMt (83) 


and hence, as before equation (44), 


LS 


M 
as att 





4 == 


It is then clear that the identification of z,. and 2.2. can be done in 
exactly the same way as when 7,,_,(jw) = 0. 

It is of importance to notice that the number of s-plane inductors 
used in the realization of Fig. 2 is equal to the s-degree, 8,[W(p, s) ] 
which in general is smaller than the number required in Koga’s tech- 
nique. Appendix C shows that 8,[W(p, s)] is the minimum number of 
s-plane inductors required in any realization, and that if a realization 
is minimal in the variable s it is automatically minimal in the variable 
p, the minimum number of p-type reactances needed in any realiza- 
tion being the p-degree, 8,[W (p, s) ].18 

The main result of this section can be conveniently put in ne form 
of a theorem: 


Theorem 4: Every two variable reactance matrix W(p, s) can be realized 
as the impedance seen at the first n-ports of a lossless (n -++ k)-port con- 
sisting of 6,[.W(p, s)] reactances in the p-plane, terminated at its last k 
ports with 6,[.W(p, s)] unit inductors in the s-plane. Furthermore, such a 
realization uses the minimum possible number of reactances of each kind. 
(The roles of p and s are completely interchangeable.) 


Since several of the proofs involved in establishing Theorem 4 were 
rather indirect and lengthy, while the procedure for synthesis, sum- 
marized in Section IV, is itself rather simple. 


IV. SUMMARY OF SYNTHESIS PROCEDURE 


Given an (n X n) two variable reactance matrix W,(p, s), decom- 
pose it as 


Wop, ) = W,(p) So W,(s) + W(p, 8) 


where W, and Ws, are reactance matrices in p and s, and W(p, s) is a 
two variable reactance matrix with no p-independent or s-independent 
poles. Such a decomposition is always possible by Theorem 2. 
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Expand W(p, s) as 


Ce CO pie 


tao § 





[which is the same as equation (7)] where the A(p)’s may be ob- 
tained by equations (16) or (16a) or by long division. 

Find g(p, s), the least common denominator of the entries in 
W (p, s) and express it in the form 


g(p, 8) = ao(p)s” + ai(p)s™™* + +++ + ao(p). 


[which is the same as equation (6) ]. 
Form the (nr X nr) matrix T,1(p), defined by 


T,-.(p) 
Ao(p) A,(p) Ax(p) +s A,-i(p) 
— A,(p) — A,(p) Asp) a8 —A,(p) 
A2(p) A3(p) A,(p) +s: A,+1(p) 


(—1)"A,-.(p) (—D)AL(p) (= 1) Asap) ++ (— 2) Aaa) 


which is equation (24). 
Factor 7',_,(p) = a3’ 7T,-1(p), a polynomial matrix, as 


T.-1(p) = MM (43) 


unless simultaneously, a) = —aj and r in equation (6) is odd, in which 
case factor —T._1(p). The factorization must be such that M is a 
(k X nr) polynomial matrix with k = rank of 7,-:(p) and M~, the 
left inverse of M analytic in the open right plane. The existence of such 
a factorization is guaranteed by Lemmas 1 and 2. 

Partition M(p) into (n X k) blocks of equation (45) 


Mp) 


M,-,(p) 
Form the (nr X nr) matrix 2(p) defined by equation (52) 
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With the identification of equations (9), (45), and (54)* 


Z(p) = A-_,(p), 


M Ap) 


and a2 = EQ) 


and a2 = M-(p)2'(p) M(p), 
the decomposition 


Wp, 8) = 2u() + 22()[eea(p) + slid 212 (4) 
is obtained. Notice that this is equation (4). It should also be noticed 
that W(p, s) can be decomposed as in equation (4) even if it has 
s-independent poles, since the assumption that W(p, oo) is finite is 
enough to guarantee the validity of the procedure. For network reali- 
zation it is usually more convenient to remove both p-independent 
and s-independent poles; we therefore removed them at the start of 
the procedure. 

To realize W(p, s) as the impedance of a passive network, we per- 
form the following operations. 

Form the (n+k xX n+k) impedance matrix Z(p) of the coupling 
network 


A_,(p) an 
Z(p) = oo : (84) 
M,(p) -1 t 
—4 ~~ M(p) 2 (p) M (p) 
ao(p) 
Realize Z(p) as a lossless (n+) port network in the p-plane and 
terminate its last k-ports with unit inductors in the s-plane. Also 
realize the reactance matrices W,(p) and We(s) as lossless p-plane 
and s-plane n-ports, and connect all three networks in series as shown 
in Fig. 1. The given W,(p, s) is thus realized as a passive network. 


Vv. AN EXAMPLE 
It is desired to synthesize the two variable reactance matrixt 


* Equation (54) is used to determine Z22(p), in preference to equation (50) since 
equation (54) is easier to compute. 

t+ Equation (84) is the same as equation (59) except that for the ze block 
equation (54) is used instead of equation (50) for the reason mentioned in the 
previous note. 

t This example was given by Koga, (see p. 50 of Ref. 8). 
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w+ Ve +) pst 
Wp, 8) = (p+s\(ps+1) pts 


ps —1 ps + 1 
p+s pt+s 


Since W,(p, s) has no p-independent or s-independent poles the first 
step 1 of Section IV need not be performed, and W,(p, s) = W(p, s). 
The least common denominator of the elements of W (p, s) is 


g(p, 8) = ps’ + (p? + 1)s +p 
[which is equation (6) ], and hence 
a(p) = p, a(p) = (p? +1), a,(p) = p, and r = 2. 


The least common denominator of the minors of W(p, s) is also g(p, s) 
and hence 





k = 6,[W(p, s)] = 2. 


In the expansion, equation (7), 





Wp, s) = Ap) + 2 


t=0 § 
by the formula of equation (16) or by long division 
1 |p? 1 p 
A_,(p) — A i "| ’ 
P Pp 
1 2 1 2 2 2 
Rie ae +1" p(p *). 
pe +l) po =A) 
1 2 1 3 47, 2 1 
Alp) aa + 1° p@’+ ) 
pP(p +1) pp’ — 1) 
1 8 6 4 2 1 6/,.2 
AGN Al + 3p + 4p + 3p +1 pw + 


pp + 1) p'(p’ — 1) 
T,-,(p) = T,(p) defined by equation (24) is 
— Pp +1) — pip?) | p(p+1) p(p?+1) 
1 | —pPe?+1) —pp?—-1) ! p(p?+1) p*(p?—1) 
DA a Se en ek are eae an ee ea . 


PY) PCED) —p(pP+1) | p+3ps-+4pt+ 3p +1 p(pt+1) 
pip): pp 1)! p(p?+1) p(p?—1) 
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The polynomial matrix T,(p) = p$T(p) is factored by the method in 
Ref. 16 as equation (48) 





p(p?-+1) —p(p?+1) 
Tip) = MM) = as) | ae wrk 
pi—pe+2p?—p+1 (p+ p3-+2p?+p+1) 
pi—p* (pp) 
gc pie —P(pt+1) m+p+2p?+p+l pi+pi } 
(2)*) p(p-+1) p(p—-1) —(pt—p'+2p?-p+1) = —(p*—p) 
The (4 X 2) matrix M(p) is partitioned as equation (45) 
M(@) = Pe 
M,(p) 
| pp + 1) —p'(p’ + 1) 
ot eee A i a re ge ay eee 
“pt — p+ —p +1 —@i +p t+ +p+D 
2 —(p* + p’) 


To find M-*(p), a left inverse of M (p), it is enough to find a left in- 
verse of Mo if it exists, since 


[M5" | 0] Fa a ee 
M, 


In our example k = 2 and M, is a nonsingular matrix and hence 
M-1(p) is given by 


ap) 1  f-p' +2) pe’ +110 
Mo) =.= 
PP = lis nip? +110 0 


From the definition of 2, equation (52) 


0 0 | 1 0 
i] 
; 0 0! 0 1 
Op) = | ---2--- [=== Sees 
=] Qe es 0 
| Dp 
| : 2 
‘Oe Si 3@ 2 
| 
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Using equation (54) 


202 = M"o'M 
p +1 _(p + 1° 
2p 2p 
_@-l pti 
2p 2p 


Hence the coupling network formed by p-type elements has the 
(4 x 4) matrix of equation (84) 





M, 
AL, a. 
Z(p) = : 
M 2 
——=2 mM0'M 
Qo 
p +1 pti pti 
2p . (2)'p (2)%p 


Pat +1 
. (2) (2) 
p+1 pti p+1i _(p+1)’ 





(2)'p (2) 2p 2p 
+l p-1 _@-) p+ 
(2)*p (2)? 2p 2p 


Z(p) can be verified to be lossless, and the given W,(p, s) can of course 
be realized as the impedance seen at the first two ports of Z(p) when 
it is terminated at its last two ports by unit s-plane inductors. 


VI. CONCLUSIONS 


The synthesis method for two-variable reactance matrices developed 
here, in general yields a nonreciprocal coupling network even when 
the given two-variable reactance matrix is symmetric, and if a 
reciprocal coupling network is desired, Koga’s method for generating 
a reciprocal network from the nonreciprocal one can be used.® This 
procedure generally yields a reciprocal network at the cost of in- 
creased numbers of elements of both kinds. 

This method of synthesis of two-variable reactance matrices has 
been successfully applied to the synthesis of lumped-distributed RC 


REACTANCE MATRICES 191 


networks which are important in microelectronics circuits.’ In prac- 
tice, the only laborious step in the synthesis procedure is the factoriza- 
tion of polynomial matrix in the desired form. Of great importance is 
the approximation of desired characteristics by rational functions in 
two-variables; any work in this area would greatly enhance the use- 
fulness of the two-variable theory. The synthesis problem of n-vari- 
able positive real functions, for which many applications can be 
found,’ can be reduced to the synthesis of (n+1)-variable reactance 
matrices.2+ 2? when n = 1 the two-variable method developed here 
gives rise to a new method of passive RLC synthesis, which is no more 
complex than the existing methods. 


VII. ACKNOWLEDGEMENTS 


The author expresses his deep gratitude to Professor R. W. New- 
comb under whose guidance this work was carried out. 


APPENDIX A 


Partial Fraction Expansion of W (jw, s) 


To guarantee the factorization of 7',-,(jw) as JM? we needed Lemma 
2, which asserts that 7’, is para Hermitian and that T,_,(jw) = 0. 
In the proof of Lemma 2 we used the fact that R;(jw), the residue 
matrices of W(jw, s), are positive semidefinite. The proof is given below. 
Under the assumption that W(p, s) has no p-independent of s-in- 
dependent poles, for each real w the s-plane poles of W(jw, s) are simple 
and restricted to the imaginary s-axis by Theorem 1. Hence, for any 
fixed w, we can write W(jw, s) as 
Wo, 8) = An(ia) + YAO 


La = jnlle) 85) 


where the a;(w) are real and the R#;(w) are the residue matrices at the 
poles ja;(w). As in equation (9), r is the s-degree of g(p, s), the least 
common denominator of the elements of W. 

By complex conjugation on both sides of equation (85) 


Wj, 9) = ALG) + Dee. (86) 


Since W and A_, are rational in jw, 


W* (ja, 8) = W(—jo, s*) 
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and 
A*,(jw) = A_,(—je). 
Hence equation (86) becomes 


. R*(@) 


W(—jw, s*) = A-i(—jo) + Da) (87) 
and 
—W(—jw, —s*) = —A_\(—jo) + > eee (88) 
Since equation (88) is an identity for s*, we have 
-W(-ia, -8) = -A.(-#) + YR. (89) 


From the definition of a two variable reactance matrix, 
W (jw, s) = —W(—je, —s) 
and by equation (19) 
A_,(qw) = —A_.(—je). 


Hence, by comparison of equations (85) and (89) we have the desired 
result 


R;(w) = R¥(). (90) 


To show that the R;(w) are positive semidefinite for each w, we first 
notice that if 


VP, 8) 
W(p, 8) = 
P18) g(P, 8) 
where y(p, s) is a polynomial matrix and g(p, s) is the least common 
denominator of the entries in W, F,(w) in equation (85) is given by 
(see p. 308 of Ref. 19) 


_ Wp, 8) 
#@) = sop, 8) ; oe 
ds p=jw 


s=jas(w) 


Denoting dg/ds by g, and dy/ds by y, , for any n X 1 constant matrix 
X, (p. 39 of Ref. 8) 
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X*pX 


8 


X*R 4 = 
peso 
s=azas(w) 


_ | SAX = gE"v.X) |" 





(K*yX)" raya) 
- [26]. 
ee a, 
Hence, if X*WX # 0 
X*R,(w)X = [2 away | (92) 
A 


From definitions 1 and 2, and Theorem 1, X*WX is a two variable 
positive function and for Re p = Res = 0 


Re [K*WX] = 0 
and 
© (xX«WX) = 0. 
Os 
Hence 
2 Wx)? = SW (X*WX) = 0 
for 


Rep = Res = 0 


and consequently the left side of equation (90) is nonnegative. 
Thus we have proved that the residue matrices, R;(w), are positive 
semidefinite Hermitian for each w. 


APPENDIX B 
Proof of Theorem 8 
Theorem 3: If 


_ Bo(p)s’ + By(p)s'* + +++ + B,(p) 
Was do(p)s” + a,(p)s"™* +--+ + a,(p) 


ts a two variable reactance matrix, then for alli = 0,1, +--+: r 
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B;/a; is a reactance matrix in p. 

a;/Q:+, 7s a reactance function in p. 

a; has all its zeros on the j axis, and these are simple. 

XB;X/XB;.,X for all constant n X 1 vectors, X, is a reactance function 
in p. 


Proof: For any constant n X 1 matrix X, 


(X*B,X)s” + (X*Byx)s" | + +++ + (K*B,X) 
aos’ + as” * + +++ +, 


is a rational function in p and s with possible complex coefficients. 
For convenience, if we define 


b; = X*B,X 
wp, s) = X*WX 
f(p, s) = bos’ + bys" * +--+ +0, 


X*WX = (93) 


and as before 


g(p, 8) = aos” ++ ays” + +++ +a, 


equation (93) can be written as 


w(p, 8) = ae (94) 


From the definition of a two variable reactance matrix, w(p, s) is a 
two variable positive function, and hence for any po with Re po > 0, 
w(po , 8) is a positive function of s.°” Consequently, for all s with Res > 0 


So 5 8) 
Re v8) = v3) 


Since equation (95) has to be satisfied for all s with Re s > 0 and hence 
for arbitrarily small s, it can be seen from equation (93) that 


Re b, (Do) > 0 
a,(Do) 


for all p,) with Re po > 0. Hence, B,(p)/a,(p) is a positive real matrix 


and since W = —Wt 
2-2) 
a a, 


and thus B,/a, is a reactance matrix in p. 
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If instead of starting from the positive function f(po, s)/g(po, 8), 
we start from 0’f(po, s)/ds’/0’9(po , s)/ds’, which is also a positive 
function” for all 0 < v S 1, the same arguments used in proving that 
B,/a; is a reactance matrix can be repeated to show that B;/a; is a 
reactance matrix in pforallO Sir. | 

Now to show that a;,,/a; is a reactance function in p, we can use a 
similar proof based on the fact that a’~*g(po , s)/ds”*/d’g(po , s)/ds" 
is a positive function.” 

Again, it can be seen from the fact that 


a”-*g(po » 8)/ds"* 18g (po , 8)/A8” 


is a positive function.” that b;,,/b; ‘isa positive function satisfying 


e)--Pe] 
b; 4 b; : 


If X in equation (93) is chosen real b;,,/b; will be real for real p and 
hence XB;.,X/XB;X for any real n X 1 matrix, X, is a reactance func- 
tion. 

To see that the zeros of a; are all simple and restricted to the imaginary 
axis: if any one of the a; has a double zero on the jw axis or a zero off 
the jw axis, from the reactance nature of a;,,/a; for allO S 7 S 1, 
all the a; must have the same zero, and, consequently, W(p, s) will 
have an s-independent pole contradicting our original assumption that 
W has no such poles. 

We have thus proved all the claims of Theorem 3. 


APPENDIX C 
Proof of the Minimality of the Realization of W(p, s) 
in Both Variables 


In this appendix we show that the realization of W(p, s) that Sec- 
tion III gives is minimal in both the 7p and s variables. From the def- 
initions of 8,[W(p, s)] and 8,[W(p, s)], it can be shown that if 
W (p, s) is finite at p = © ands = o, 


8,1 (p, )] = a.Ln(p, 8)] 
8,{W(p, 3] = 8{n(p, 8)] 


where tne two variable real polynomial 


CAG = AOE ae ee aly % S60) 
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is the least common denominator of all the minors of W(p, s). The 
form in which 7(p, s) is written in equation (96) immediately reveals 
that 


5,[W(p, s)] = k. 
And if »(p, s) is written as 
a(p, 8) = co(p)p” + c(p)p™" + +++ +.¢,(p), (97) 
it can be seen that 
5,[W(p, s)] = m. 
c.1 Minimum Elements 


Next we would like to find the minimum number of elements of each 
kind needed in the realization of W(p, s). 


Lemma I states that k, the rank of 7,_,(p), is equal to the s-degree 
of W(p, s), and the realization obtained there uses exactly k s-type 
elements. By equation (4) 


W(p, 8) = 2u(p) + 212(p)[Z22(p) + sli]*z12(p). 


Suppose that there exists a realization with kp s-type elements, where 
ko < k = rank T,_,(p). Then, 


Wp, 8) = z1x(p) + 212(p)[2o0(p) + sl.J *Zi2(p) 


where the matrices 2;.(p) and Z..(p) aren X k, and ky, X ky , respectively. 
Then by equation (25), T,-,(p) = N(p)N‘(p) where 


£12 (p) 
N(p) = 212(P)Z20(p) 


Z12(p)Z22 (p) 
is an mr X ky matrix and hence, rank N(p) S k,. Also, we have 
rank T,_,(p) S rank N(p) S ky < k = rank T,.,(p) 


which is a contradiction, and hence k = rank T,.1(p) = 8 [W(p, s)] 
is the minimum number of s-type elements required in any realiza- 
tion. Now by repeating the same argument with a realization of 
W(p, s) where p-type elements are extracted instead of s-type ele- 
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ments, we can see that any realization must contain at least m p-type 
elements where m = §,[ (p, 8) ]. 


c.2 Minimality of the Realization in Section III 

We next discuss the minimality of the realization in both p-type 
and s-type elements. For the purpose of realization, the reactance 
matrix W(p,s) was decomposed as 


W(p, 8) = zu(p) + Z12(p) [Zo0(p) + sle]7*z10(p) (4) 
where 
Zp) | Z11(p) ee (11) 
—2;2(p) Z22(p) 


can be realized as the impedance matrix of a lossless (n + k) port 
in the p-plane. W (p, s) is the impedance seen at the first n ports when 
the above (n + k) port network is terminated with unit s-plane in- 
ductors at its last k-ports. Since k is the s-degree of W(p, s), the 
realization uses the minimum number of s-type elements. To show 
that the realization uses the minimum number of p-type elements, 
we have to show that 8[Z(p)] = 8,[W (yp, s) ]. For this we need a rela- 
tionship that exists between the least common denominator of the 
minors of W(p, s) and the determinant |z20(p) + s1,|. 

Every minor of [Z22(p) + s1;]- can be expressed as p(p, s)/¢(p, s) 
(See p. 21, of Ref. 12, Vol. 1) where p(p, s) and ¢(p, s) are polynomials 
in s with coefficients from the field of rational functions in p. Further- 
more, 


g(p, 8) = | Ze2(p) + sl. | 
is a monic polynomial in s of degree k. 

Since W(p, s) has no p-independent or s-independent poles, every 
zero of n(p, s) is a zero of ¢(p, s), and since k = 8 [¢(p, s)] = 
Ss[y(p, s)], y(p, 8) and y(p, s)/d.(p), which are monic polynomials in 
s with rational functions of p as coefficients, must be identical. Hence 


|2n(p) + sl, | = 52. (98) 


To show that 6[Z(p)] = 6,[W(p, s)] (since we already know that 6[Z(p)]= 
6,[W (p, s)]) itis sufficient to show that 6[Z(p)] S 6,[W(p, s)]. To establish 
this inequality, consider the matrix S(p, s) defined by 


S(p, 8) = [Z(p) — slasillZ(p) + slasil’. (99) 
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When s = 1, S(p, s) is the scattering matrix of a lossless network, 
since Z(p) describes a lossless network and (see p. 184 of Ref. 11) 


§[Z(p)] = 6[S(@, 1)]. (100) 
Since S(p, 1) is para unitary (see p. 131 of Ref. 15) 
{S(p, 1)] = {| S@, Y) II. (101) 


Equating the determinants of matrices on both sides of equation (99) 
= | Z(p) me, Slisk | 
| S(p, 8) | > | Z(p) ae slaaz | 
Using a formula from the theory of determinants (see p. 46 of Ref. 
12, Vol. I) 
| S(p s) | | (Zia — sl,) + Zinlaes = s1,)7'Zt2 |- | Zo — Sl, | 
| (214 + sl,) + Z12(Zo2 + sl,)' 215 |- | Zoo + Sl, | 
_ |W, —s) — sla | | 22 — she |, 
|W(p,s) +s1, | | 222 + 81, | 
Now if |W(p,s) + s1,| is written as 


(102) 


| W(p,s) + sl, | = ne (103) 


where h(p, s) is a real polynomial in 7p and s, since the left side of 
equation (102) is finite at p = 


5,[h(p, s)] S 5,[n(p, 8)] = 5,[W(p, s)]. (104) 
Substituting equations (98) and (103) in equation (102), we have 


| S(p,s) | = hp, —8) (Dp, 8) 8) np, —8) —s) do(p)_ 


nlp, —8) h(p,s) do(p) —a(p, 8) (105) 
_ hp, 8) 
h(p, 8) 


and by equations (100), (101), and (104) 


[Z(p)] = Sp, 1)] S 5,[W(, s)]. 
We have thus shown that 8[Z(p)] = 8,[W(y, s)]. 

It should be noted that Z(p) is the impedance matrix of any loss- 
less coupling network in a realization of W(p, s), minimal in s, and 
hence we come to the important conclusion that if a realization of 
W (p, s) is minimal in one of the variables it is automatically minimal 
in the other variable. 
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Computation of the Noncentral Chi-Square 
Distribution* 


By G. H. ROBERTSON 
(Manuscript received July 17, 1968) 


This article gives a formula that allows accurate values of the cumulative 
noncentral chi-square distribution to be computed. Although this distribution 
has been recognized for a long time, none of the standard references give 
formulae that are suitable for computing accurate values over an extensive 
range of the parameters; approximations in terms of the chi-square dis- 
tribution are usually recommended. A program writien by the author, based 
on the formula given here, has been successful for computations involving 
more than 10,000 degrees of freedom. Since many steps are required when the 
degrees of freedom are as large as this, the program is not “‘fast” but tt is 
believed to be accurate. 


I. INTRODUCTION 


The Non-Central Chi-Square Distribution is encountered in many 
statistical problems, one of the most important in communications 
studies being the detection of signals in noise using a square-law 
detector.1 Marcum discussed this application but concluded that a 
satisfactory algorithm for computing system performance could not 
be based on the formula he used.? This article shows that a satisfac- 
tory algorithm can be based on the formula that Marcum derived if 
the expression is expanded in a power series and the terms are properly 
grouped before being evaluated. 

More recently Urkowitz’? discussed detection system performance 
in which the above distribution arose and recommended that approxi- 
mations in terms of the chi-square distribution, given by Patnaik,2 
be used for computation. While these approximations are adequate 
for some purposes, it is desirable to have a reliable and accurate 
method of computing values, if only to check the approximations. 


* This work was supported by the U.S. Navy under contract N00039-68-C-3584. 
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Il. INTEGRATION OF NONCENTRAL CHI-SQUARE DISTRIBUTION 


If the signal-to-noise power ratio is x for the sum of » independent 
samples of the output of a square-law detector, the following integral* 
gives the probability that the sum will be y or more. The variables 
are normalized to the variance of the individual noise samples, so 
the average signal-to-noise power ratio for one sample is #/y and the 
average output per sample is y/y. Considering one sample of noise 
to be the sum of the squares of two independent gaussian variables 
of unit variance, the integral is related to the noncentral chi-square 
distribution by the conversions given in equation (8). 


rr) (n-1) /2 : 
a= f(A exp (2 — aR) ae. (1) 


From (Ref. 4, Section 8.445) [,, [2 (zx)%] is the modified Bessel 
function 


_>__ar 
Int) = 20 RG bE ED 2) 
Thus 
0 RED PO ay apt dee RED [aap (ae 
245 tg St dae 
Wn 2k +1) © 88 ay + DG + 2) 


Notice that 


: exp (—2)e""? dz = T(n, 9) (4) 


the incomplete gamma function (Ref. 5, Section 6.5.3). Since 


- Pop Nears PaaS) Pee ’ exp(—) dt (8) 


equation (8) can be written 


_ exp (—2) 
Q = T(n) | re, y) 


a rT {P6, y) + (“ye exp (-} 


* Sometimes called the generalized Marcum Q-function. See Ref. 2. 
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+2{rG, y+ (4 n(n + 1) = n(n + 1)(9 + s)¥ oe »} 


+--+ and so on (6) 


an 


mn 


Summing the terms by columns gives 


Tin, y) , y" exp (-y) oP 
a (1) + T'(n) E a ae 


_y 7 x" 
Tey pas 


ees, ee ee, 
Fee IG ee) ai ) a 
+ ---+ and so on | (7) 


A satisfactory computing algorithm can be based on equation (7) 
where we notice that Q can be expressed as the sum of two parts, Q1 
= IT(y, y)/I(m) which is independent of x, and another part which 
we call Qo. ; 


III. DISCUSSION 


The Siar chi-square cumulative distribution can be written 
Q(x?|v, 4) (see Ref. 5, Section 26.4.25), where the distribution is in- 
tegrated from ,” to infinity, the number of degrees of freedom is y, 
and the noncentral parameter is A. This integral is the same as that 
given in equation (7) if we put 


yp = 2n 
m= Qy (8) 
A = 2x 
so that 
Q(2y | 2n, 2x) = Qi + Q | 0) 


= QQy | 2n) + Q2 
where Q(2y|2n) = = Q(x?|v), the cumulative chi- “square distribution 
(see Ref. 4, Section 26.4.2). 
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If M independent samples of the output of a square-law detector 
are averaged, when the input is narrowband gaussian noise plus a CW 
signal at the center of the band, Q can be used to find the probability 
that a threshold value will be exceeded. Expressing all parameters in 
units of the narrowband noise power, the desired threshold is y/n, x/y 
is the signal-to-noise power ratio, and M = 7. 

It is interesting that the Rayleigh distribution, and the Rice dis- 
tribution, are equivalent to the chi-square and non-central chi-square 
distributions respectively, when the latter are expressed in terms of a 
parameter x equal to the square root of x?, and y = 1. 

Marcum? gave an expression of the form shown in equation (3) 
for the output of a square-law detector. He stated that it could only 
be used satisfactorily for values of » up to about 10. More recently 
Urkowitz? has discussed the integration of a square-law detector out- 
put and recommends that the noncentral chi-square distribution be 
computed using an approximation given by Patnaik? in terms of the 
chi-square distribution. Patnaik compares with exact values some 
results computed using the approximation and finds errors of the 
order of 1% around Q = 0.5. The accuracy is much less for values 
around unity and for values less than 0.01. 

Brennan and Reed have shown that, when the order of the Bessel 
function in equation (1) is zero, corresponding to one sample, a 
straightforward recursive method applied to the resulting equation 
(6) can be used to compute the integral.© They suggested that a 
similar procedure could be used even on the form of equation (1) 
given here. However, as pointed out by Marcum, such a technique 
rapidly becomes useless as y increases above about 10. 

A program written by the author, based on equation (7), has been 
used satisfactorily for » as large as 8192, and simultaneously for 
values of x/y up to 0.1. The exact values given by Patnaik were 
checked. Further checks were made possible by the development of a 
uniform asymptotic expansion by 8. O. Rice, with which it is possible 
to get results outside the useful range of the algorithm given here.” 

Table 1 compares values obtained with the author’s program 
(CHISQ) and corresponding values supplied by 8. O. Rice using his 
uniform asymptotic expansion (UAE), with results obtained using 
the Patnaik? and Gauss approximations (Ref. 5, Section 26.4.29). 

The accuracy of the algorithm given in equation (7) decreases as 
z/y increases in the table, and the value in the last entry depended 
quite sensitively on the last digit of a 18 digit double precision con- 


TaBLE I—CoMPARISON OF COMPUTATION METHODS 


zsn y/7 n 1-CHISQ UAE PATNAIK GAUSS 
0.01 1.05 8192 0.999801547E-00 0.9998015E-00 0.999801544E-00 0.999809E-00 
0.05 1.05 8192 0.501464546E-00 0.5014645E-00 0.501467E-00 0.50110E-00 
0.08 1.05 8192 0.552623909E-02 0.5526235E-02 0.552472H-02 0.56050E-02 
0.1 1.05 8192 0.138627645E-04 0.1386275E-04 0.138700E-04 0.14803E-04 
0.11 1.05 8192 0.281186446E-06 0.2811860E-06 0.280145E-06 0.31438E-06 
0.12 1.05 8192 0.316387190E-08 0.3163860E-08 0.314279E-08 0.37651E-08 
0.13 1.05 8192 0.20004E-10 0.199969E-10 0.197750E-10 0.25799E-10 


NOILOAGIULSIG AUVNAOS-IHO 
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stant used in the program. Notice that even for the last entry, the 
value actually computed, (CHISQ), appears to be correct to 14 places 
after the decimal point. 


IV. EXTENSION TO A MORE GENERAL INTEGRAL 


A more general integral is obtained by writing, for example, the 
8th moment of the partial noncentral chi-square distribution, 


Q, = im o(2)" oe exp (—z — 2)I,-:[2(ex)*] dz. (11) 


xv 


The corresponding form of equation (4) is 


[2 exp (ae dz = TE) (12) 
where 
E=n+8, (13) 


and the corresponding form of equation (7) becomes 


Q = Ee exp (—a).FiGi 95.2) 


y** exp (—y) Sa" (E+ 1),-1 
+ TO E BE) 229 a), 


wu (E +.2),_, 
1G wae OP(O eee 


ee) ee oe (E+ 8), 
Tae Ne —p oP) ies rl (n + 3)e-1 


+ +--+ and so on |. (14) 


The confluent hypergeometric function ,Fi(a; 6; x) (Ref. 5, Section 
13.1.10) is defined by 





ch te az , aoa+ De , aat+ Ia +t 2)2° 
F(a; b52) = Lt at bb ++ 2! b+ Db+23!° 
ie (15) 
- or 


Equation (15) conveniently gives an example of Pochhammer’s sym- 
bol (a), (Ref. 5, Section 6.1.22), also used in equation (14). 
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The structure of equation (14) is closely related to that of equa- 
tion (7), so it can form the basis for a useful algorithm to compute 
the integral given in equation (11). 


Vv. ACKNOWLEGEMENT 


The interest of Mr. Stephen O. Rice in this work and his encourage- 
ment to publish the results, is gratefully acknowledged. 


REFERENCES 


1. Patnaik, P. B., “The Noncentral x? and F Distributions and Their Applica- 
tions,” Biometrika, 36 (1949), pp. 202-232. 

2. Marcum, J. I.: “A Statistical Theory of Target Detection by Pulsed Radar: 
Mathematical Appendix,” Research Memorandum RM-753, The Rand 
Corporation, July 1, 1948. Published also as: Marcum, J. I. and Swerling, 
P., IRE Trans., PGIT, /T-4, No. 2 (April 1960). 

. Urkowitz, H., “Energy Detection of Unknown Deterministic Signals,” Proc. 
IEEE, 66, "No. 4 (April 1967), pp. 523-531. 

. Gradshteyn, 1. 8. and Ryzhik, I. M., Table of Integrals Series, and Products, 
New York: Academic Press, 1965. 

. Abramowitz, M. and Stegun, I. A., eds., Handbook of Mathematical Func- 
toons, National Bureau of Standards, Applied Mathematics Series 55, 
Washington, D. C., 1964. 

. Brennan, L. E. and Reed, I. S., “A Recursive Method of Computing the Q 
Function,” IEEE Trans. on Inf. Th. (Correspondence), [T-11, No. 2 (April 
1965), pp. 312-313. 

. Rice, 8. O., “Uniform Asymptotic Expansions for Saddle Point Integrals— 
Application to a Probability Distribution Occurring in Noise Theory,” 
BS.TJ., 47, No. 9 (November 1968), pp. 1971-20138. 


oe & 


for) 


J 


Uniform Approximation of Linear Systems” 


By HARRY HEFFES and PHILIP E. SARACHIK 
(Manuscript received August 6, 1968) 


A method for reducing the complexity of the class of linear, time-varying, 
dynamic control systems ts developed where the approach taken is that of 
uniform approximation (that is, modeling for a region of initial conditions). 
The objective of the modeling procedure is to choose a linear time-invariant 
system of given dimension, that minimizes a ‘‘worst-case”’ type of error 
criterton. Some results from the theory of widths of sets in Banach space 
are used to obtain bounds on the optimal approximation error as a function 
of the dimension of the approximating system. The use of these bounds in 
choosing the order of the approximation is discussed. An example illustrates 
the use of the derived results. 


I, INTRODUCTION 


In the analysis and design of control systems it is often useful to 
have low order constant coefficient models for the system. The prob- 
lem of modeling linear systems by lower order linear systems has 
received considerable attention, but these analyses have usually been 
restricted to the modeling of constant coefficient systems. 

References 1 through 5 contain various approaches to the system 
approximation problem; however, these analyses are generally re- 
stricted to the modeling of constant coefficient systems or systems 
which are forced with a given input or initial condition. 

The control system analyst often finds himself dealing with non- 
stationary systems, but little work has been done in the area of 
optimally modeling this class of systems. The emphasis here is on 
modeling the class of linear, homogeneous time-varying systems with 
constant coefficient models. Reference 6 considers approximation of 
forced systems. Rather than design the model requiring solutions of 
the actual and approximate systems be “close” for a prescribed initial 


*From a dissertation written as part of the requirements for a Ph.D. degree, 
New York University, 1968. 
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condition, the approach taken here is that of uniform approximation. 
Initial conditions are assumed to lie in some set in Euclidean space 
and a “worst-case” type of error criterion is defined. This eliminates 
tuning the model to specific conditions which may not be met when 
using the model. The material presented here thus generalizes previous 
work in that it extends the class of systems considered to time-varying 
systems and generalizes the error criterion to handle the more realistic 
problem of modeling for regions of initial conditions. 

The problem is of importance, for example, in trajectory analysis 
where the linear time-varying system is obtained by linearizing a set 
of nonlinear equations about a nominal trajectory. In this case the 
time-varying nature of the system is described by partial derivatives 
evaluated along the nominal trajectory. Solutions to the resulting 
equations require simulation for each set of initial conditions. Using 
a constant coefficient model eliminates the need for repeated simula- 
tion. 

The above example illustrates the use of a simplified model in 
analysis. The designer is interested in reducing the complexity of 
high-order nonstationary control system plants since this provides 
a means for designing simpler controllers based upon the model de- 
scription. The results presented here not only allow one to obtain 
stationary models but simultaneously offer the opportunity to obtain 
lower order models of the original system. 


II. PROBLEM DEFINITION AND FORMULATION 


The system we are considering is described by the linear, time- 
varying, homogeneous vector differential equation 
é) = A®z® (1) 
with the outputs given by 


y@) = Cx) (2) 
where 


x(t) is an n-vector 


A(t) is an” X n matrix whose elements are bounded and piecewise 
continuous on [é, , ¢,]. 


C() is an m X n matrix whose elements are bounded and piecewise 
continuous on [é, , ¢,]. 
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It is desired to obtain a constant coefficient system of kth order* 
(k 2 m) 


&(t) = Ax(t) (3) 


such that the first m components of the state vector £(£) closely approxi- 
mate the components of y(t) over the finite time interval [t, , ¢;]. Writing 
| GQ) = Ca@ (4) 
with 
C = [I mXm : 0] 

the approximation problem can be viewed as choosing the elements of 
the k X k matrix A such that 7(¢) approximates y(t) over [f, , ¢;]. 

Since, in general, it is not known at the time of modeling what initial 
conditions will exist in the system, it is desirable to have the approxi- 
mating system depend on a prescribed range of initial conditions rather 
than being tuned to any specific initial condition. The initial conditions 
are considered to lie in a closed, bounded convex subset of Euclidean 
n-space. That is, 


xt,)eRCEH,, 
and the performance criterion is ie by 
J(A) = max r min . “yy — 7’ WOYy — H at (5)T 


where 


[t. , ts] is bounded 
y(t) is the solution of (1) and (2) with z(é,) = x, 
§(t) is the solution of (8) and (4) with #(,) = &, 
W(t) is positive definite and bounded for all ¢ « [t, , ¢;]. 


The above performance criterion corresponds to the worst case error 
in the approximation, corresponding to a given model, when the initial 
condition on the model, £(¢,), is chosen optimally in terms of the initial 
conditions on the actual system. The modeling objective is to choose 
A to minimize J,(A)(that is, minimize the maximum approximation 
error). 

The approximation problem will be cast into a Hilbert space setting 


* Notice that k is not restricted from above. It may be desirable to have k 
> nif the original system is time-varying. 
t In all that follows the prime denotes transpose. 


212 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1969 


which will permit the use of many of the general results to be presented 
in the next section. Vector spaces of solutions of the original system 
equations and any member of the class of approximate system equations 
are established. These spaces are then imbedded into an encompassing 
Hilbert space. It then is shown that the problem of finding an optimal 
approximation can be viewed as a problem of finding the “‘best’’ subspace 
(of a given form) of the Hilbert space to use in approximating solutions 
of the original system. Writing the output vector of the original system 
in terms of the transition matrix leads to 


yQ = COX t.)x@.) (6) 


where the transition matrix ®(t, t,) satisfies 
Bt, 1.) = ARE, t,) @ 


with initial conditions 
P(t, ,t,) = I. (8) 


Now if the original system is completely observable’ on the finite 
interval [é, , t;] the columns of the m X n matrix C(é)®(i, t.) are linearly 
independent as vector-valued time functions. That is, 


CHE, t.)a(t.) = 0 forall tet, t] 


implies x(t,) = 0. For an observable system, the initial state can be 
determined uniquely from knowledge of the output. Since z(¢,) = 0 => 
y(t) = 0 and, from observability, y() = 0 = x(,) = 0 the linear in- 
dependence of the columns of C(1)®(, t,) follows. 

Let y be the linear space spanned by the n columns of C(t)®(, ¢,). 
The solutions of the original system lie in y, which is of dimension n for 
an observable system. Notice that the number of components (m) in 
the vector y and the dimension of the space Y need not be the same. If 
the system is not completely observable on [é, , t;] the dimension of ¥Y is 
less than n. 

The solutions to equations (3) and (4) can be written as 


H(t) = Cea“ 3(t,) (9) 


where 
iz ) A‘(t a ay 
A(t-—to) __ a 


a! 


and 
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ad Alt-to) __ A(t-#o) 

qa’ = Ae : (10) 
It is thus seen that solutions §(¢) lie in the vector space spanned by the 
k columns of the m X k matrix Ce*“'~'*, Denote this vector space as 
y,. If A is such that the approximate system is observable then ‘Y;, is 
of dimension & and the k columns of Ce*“~‘® form a basis 


{g.57 = 1, vee yk} 


for the k-dimensional vector space Y, of approximating solutions. These 
basis elements can be written as 


gh = 0 OR, (11) 
g: = {g.);telt. , tJ} (12) 


where K; is the 7th column of the k X k identity matrix. If the approxi- 
mation is not observable the dimension of ‘Y; is less than k. In any case 
vector spaces Y, with basis elements of the form (11) characterize the 
approximating systems where A isa k X k real matrix. Defining 


OD. = {Yej Bi, °°° , Se span Y;} (13) 


where g;(é) is given by equation (11) and A is any real constant k X k 
matrix casts the problem into finding an element of D, minimizing J, . 

The problem of finding an optimal approximation has been cast into 
the problem of finding an extremal space Y% « D, of approximating 
solutions. A Hilbert space 3¢ containing y and all members of 9, will 
now be constructed. _ 

Recall that the elements of Y and Y;, are real, vector-valued, time 
functions having m components. Thus each element of the Hilbert 
space 3C to be constructed will have m components. The inner product in 
5C is defined by 


wi s¥) = [now Ould at (14) 


where W(t) is a real symmetric m X m matrix which is positive definite 
for ¢ ¢ [t, , t;] and whose elements are bounded for ¢ « [é, , ;]. Notice that 
this is the same matrix appearing in the performance criterion given 
by equation (5). The norm of an element in JC is given by 


livll = @, y)#. (15) 
The Hilbert space 3 is defined as 
5c = {y; y has m components, || y || < ©} 
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where || y || is given by (15) and the inner product given by (14). 
Since 
ly aoe t, < 0 


and the elements of A(t) and C(é) are bounded it follows that solutions 
of equations (1) and (2) are bounded thus yielding 


YC a. 
Since elements of Yj, are bounded over the finite interval [é, , t,] 
Yi, C KH. 


That Yj, and Y are subspaces of 3C follows from the fact that any finite- 
dimensional linear set in a normed space is closed’”. 

The set of functions to be approximated are solutions to the original 
system equations with the initial conditions x(é,) satisfying 


ti) eh Ci. 


where Ff is a closed, bounded convex subset of Euclidean n-space. 
Writing 


F = ly; y@) = CO&, t.)x(.), x(t.) « BR} (16) 
gives 
JA) = max min lly —F II (17) 


where the modeling objective is to find 
d; = inf max min || y — § ||’. (18) 
UreDe. yeF Fey 
Before proceding to solve the formulated approximation problem, 
some results from the theory of widths in Banach space are outlined. 
Lower bounds on the optimal performance are found as a function of 
the dimension of the approximating system. 


III. WIDTHS OF SETS IN BANACH SPACE AND LOWER BOUNDS* 


Classically, approximation theory was concerned with the follow- 
ing problem. Given a function to approximate and a set of approxi- 
mating functions (sinusoids, exponentials, and polynomials, for ex- 
ample) find that linear combination of approximating functions which 


* Ref. 7 contains an excellent. treatment of widths of sets in Banach space. 
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minimizes some distance function. Notice that here the approximat- 
ing functions are given as part of the problem statement. 

Rather than approximate a single function, the problem under con- 
sideration is to approximate the class of functions § given by (16). 
For a given class of functions § it is desired to obtain a “best’”’ set of 
approximating functions rather than to choose the set arbitrarily. A 
measure of comparison is introduced which enables one to evaluate the 
efficiency of different sets of approximating functions. The following 
definitions serve to illustrate these ideas. 

Let ® be a Banach space containing a set of functions § to be approxi- 
mated by elements of an n-dimensional subspace, X, , of @. It is desired 
to find the “best”? n-dimensional subspace, or equivalently the “‘best”’ 
set of approximating functions to use in approximating elements of §. 

For a given f ¢$ and X, C@ 

inf || f — x || 

zeXn 
represents how well one can do in approximating a given f with elements 
of X,. Taking the supremum of the above quantity over all elements 
in ¥ leads to the following definition. 


Definition 1: The deviation of F from X, is given by 
Ex,(&) = sup inf || f — = ||. 
feF xzeXn 
The deviation represents the worst case approximation error over the 
class § when using elements of X, . Notice that the deviation serves as 


a performance measure of X,. Taking the infimum of the deviation 
over all n-dimensional subspaces of ® leads to the following definition. 


Definition 2: The nth width of F is given by 
d,(S) = inf Bx,(5) 


XncB 


= inf sup inf || f— ||. 
Xno@ feF xceXn 


Some of the elementary results following from the above definitions are 
(t) The monotonicity of the width: 
dj(F) 2 dF) 2 4,5) 2 --- 
and 
(ii) The nested property: If $, C $2 C --- then 
d,(5:) S di(F2) S ---. 
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Notice that 

J.(4) = E%,G). (19) 
In defining d? the infimum of the square of the deviation was taken 
over the j-dimensional (j S k) subspaces in D, whereas in defining d, 
the infimum was taken over all k-dimensional subspaces of ®. Using the 
monotonicity property of the width, with 3¢ serving as the required 
Banach space, gives 

J,(A) = di) (20) 
for any k X k matrix A. 
Definition 3: U,, is a closed ball of radius r in X, if 
Us 120X, 7s || a let): 


The following theorem, by Gohberg and Krein, is proved in Ref. 7 
and will be found useful. 


Theorem: If Xn, 1s an (n + 1)-dimensional subspace of a Banach 
space ® and if U4, is the closed ball of radius r in X41 then d,(Uns1) = 1. 

This theorem and the nested property of widths can be used to obtain 
lower bounds on d,(S). This lower bound can be obtained by constructing 
a ball in an (x + 1)-dimensional subspace and choosing 7 such that 
U.+1 C §. Using the nested property then leads to 


r= dn(Un+1) = d,(S). (21) 
Since 
d. = dh (22) 


the radius of ball also serves as a lower bound on (J,)}. 


Lemma 1: Let ®(t, t.) and C(t) be the transition matrix and output 
matrix, respectively, of the original system (1) and (2). Assume this system 
to be completely observable on |t, , t;]. Let W(t) satisfy the previously stated 
conditions. Then the matrix 


vee / ” a(t, L)CYW(NC((t, t,) at (23) 


is positive definite. 
Proof: Consider the quadratic form 2{Mz» = || y ||? 2 O where 
x(t.) = 2%, thatis, y(t) = Cb, t,)xo. 
Now || y ||? = 0 = y@ = Oon[Z,, ty). 
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Since the system is observable y = 0 = x = 0. Thus J is positive 
definite. 
The following theorem provides the lower bound on the performance. 


Theorem 1: Let R be the closed region of initial conditions on the 
original system and let x(t.) = 0 be an tnterior point of R. Assume the 
system to be completely observable on [t, , t;]. Denote the boundary of R by 
dR and let 


p= min 2’(t,)2x(t,). (24) 

a(to)edR 
Let the eigenvalues of the positive definite matrix M be ordered \,(M) = 
A2(M) = --- = X,(M). Then the performance, for any k-dimensional 


approximating system, satisfies J,(A) = p’na1(M) for k < n. 
Proof: Let 
F = fy; yf = CH, t.)xt.), x(t.) © R}. 


A k + 1 dimensional ball will now be constructed which is a subset of 
¥. Consider the & + 1 dimensional ball of radius r 


Oss = {Y; y(t) = CO&(E, t,)a(to), x(t) & Ope Cc E,, || y || < r} 


E,,: and r will be chosen such that U,,, C . Since M is real and sym- 
metric it can be diagonalized with an orthogonal matrix T. Thus M = 
T’AT and 


ly |? = (Tx) Al Pa(t.)] = 2’Az 


where 
| aes 
Ay 0 
As[ + 
0 An 
and 
z= Tx(,). 
Defining 
Busi = {xt.); [Trt], = 0, t= hk +2,--+,n}. 
and 


r= p Nev (M) 
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gives 
Uses = fy; yd) = CWE, ta(t), 
ly IP S$ pu), [Pelt.)]; = 0 
=k+2,---,n}. 
Thus for y « U,4; 


ly I? = 2’) M(t.) = 3 Zid; Sp dest - 


t=] 


Since 
z; =0 t=k+2,---,n 
and 
Le 
—P2>] for +X k+1 
Newt 
we have 


It then follows, from the definition of p’ and the fact that zero is an 
interior point of R, that x ¢ R and therefore y « 5. Thus U;., C F 
and the desired result 


J,(A) = di(S) = p nvr (MZ) k<n 
follows. 


Remarks: Recalling that the eigenvalues of M are ordered, we 
notice that the lower bound is a decreasing function of the dimension 
of the approximating system. This result can be used to determine 
what order aproximating system (at least) need be considered to 
achieve a given performance. We emphasize that the bound depends 
on the original system and is obtainable prior to the modelling proce- 
dure. From an engineering viewpoint, if one has an approximating 
system whose performance is “close” to the bound it may not be 
necessary to seek the minor improvement. Notice that the only prop- 
erty of R appearing in the lower bound is p and no attempt was made 
to take the orientation of the set into account. The bound will there- 
fore be least conservative when F# is a hypersphere of radius p. 
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IV. EVALUATING THE PERFORMANCE FUNCTION 


In this section the problem of finding the performance (or equi- 
valently the deviation) of a given approximating system is considered. 
The optimal choice of initial conditions on the approximating system 
is obtained using some elementary Hilbert space concepts and it is 
shown that 


inf ||y — ¥ | 
FeV, 


is a positive semidefinite quadratic form in x(t,). Next, properties of 
convex functions are used to evaluate the performance for different 
classes of regions of initial conditions; namely, for ellipsoids and con- 
vex polyhedra. The Powell algorithm for minimizing a function of 
several variables, without calculating derivatives, is then outlined 
and applied to the system approximation problem. 


The problem of finding 
& = inf lly — ¥ Il (25) 
yeYy 


is equivalent to finding the best choice of initial conditions on a given 
approximating system characterized by Y, « D,. It can be shown” 
that there exists a unique j* © UY; (y* is called the projection of y in the 
space Y;,) such that 


& = |ly —F* I? = lly ll? — WH IP (26) 
Furthermore, since g; , £2, °° , Z, Spans Y, , ¥* has the representation 
k 
y* = > g ty 
i=1 
where 


(y, 1) 


I 


G(g: press g.)£* (27) 


y, &x) 
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and G is the Grammian of {g;;7 = 1, --- , k}, that is, 
[Gein hae Ae ka) 4 GF SA eee gk 


Any solution to (27) results in an optimum choice of initial conditions 
on the approximate system. If the g,’s are linearly independent (this 
corresponds to the system being observable) the Grammian is invertible 
and £* is unique. Thus 


(y, 21) 
a*(t.) =@(gi,---, eo} - (28) 
(y, &) 
where Gt is the pseudoinverse” of G. 
The Grammian is given by 
tro - i 
Cg, etn 2) LS i eV OIG) Cen dt (29) 
and 
(y, gi) = KF x(t.) (30) 
where F is given by 
tf = 
Re i of “tO CWT (C(t) B(E, to) dl (31) 
Using (80) in (28) gives 
£*(t.) = GF axi(t,). (82) 


Thus the optimal initial condition on the approximating system 
is obtained by linearly transforming the actual initial condition with 
the (k X n) matrix G'F. Using the orthogonality property (26) yields 


lly — # |’ = lly ll’ — 2°@.)Ga*(¢.). 
Letting 
ts 
M = / &'(t, t)C'()W(NC()4(t, t,) at (33) 
and using (32) and the symmetry of G (and thus G"*) gives 
lly — #* | = 2'@.)(M — F'G*F)x(¢,). (34) 
In summary, 


& = inf lly — ¥ ||? = x(t) Date.) (35) 
yess: 
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with 
D=M — F'G'F. (36) 


Thus, finding the optimal initial condition on the approximating sys- 
tems leads to the positive semidefinite quadratic form (35) for the 
approximation error. The above represents the first step in evaluating 
the performance of any given approximating system. 

Since D is a positive semidefinite matrix, §* defined by (35) is a 
convex function of the initial state x(t,). The following theorem from 
Ref.13 is useful in maximizing 8°. 


Theorem: If the absolute maximum of a convex function, defined 
on a closed, bounded, convex set, is finite then the absolute maxi- 
mum ts taken on at an extreme point of the set. 


Remarks: An extreme point of a convex set is a point in the set 
that cannot be written as a convex combination of two other points 
in the set. Notice that an extreme point is a boundary point; how- 
ever, generally not every boundary point is an extreme point. Thus, 
if one is seeking the absolute maximum of a convex function defined 
on a closed, bounded, convex set only boundary points need be con- 
sidered. Also if the domain of definition is a convex polyhedron (a 
closed, bounded, convex set with a finite number of extreme points) 
the absolute maximum can be obtained by simply evaluating the 
function at the extreme points and choosing the largest value. 

Two general classes of closed, bounded, convex regions of initial 
conditions are considered in this paper, the ellipsoid and the convex 
polyhedron. 

Let the region under consideration be an ellipsoid defined by 


R = {x(t.); 2'(t.)Bu(t.) S 1°} (37) 
where B is a positive definite, symmetric matrix and r is finite. Notice 
that R is closed, bounded, and convex. Now the constrained maxi- 
mization problem is one with an inequality constraint. Using the con- 
vexity of R and 8, the absolute maximum of the quadratic form is 
seen to take place on the boundary of the set R. Thus the performance 
can be written 

J,(A) = max x'(t,) Da(t,) 


z(to) 
subject to the constraint 
x’ (t,)Bx(t,) = 1’. 
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It can easily be shown that the x(t,) maximizing the quadratic form 
is the eigenvector of the matrix BD corresponding to the largest 
eigenvalue and the maximum is given by 


J(A) = Ymax(BDYr”. (38) 


A convex polyhedron is usually representative of the type of in- 
formation one has as to the range of initial conditions. As an example 
of this situation consider the original system to represent linearized 
equations of motion of a space vehicle. Suppose it is known that the 
range of initial conditions are in terms of bounds on position, ve- 
locity deviations, and so on. For example, 


| x,(t,) | S 100 feet. 


| x(t.) | S 5 feet per second. 


This particular region is described by a rectangular region in state 
space with the extreme points being the corners 


a | a | 
5) —5 5 —5 
In general for this type of initial condition region, that is, 


| x(t.) | S 0; t=1,--°,n, 


the region has 2” extreme points. Since 5? is an even function of x(t,) 
it is only necessary to consider 2”? extreme points eliminating from 
consideration the negative of any point considered. 

The convex polyhedron region also is important, for example, since 
it may be used to simply approximate a more complex region. In gen- 
eral, let 


eo? ¢=1,2,-++,N 


be the extreme points of the convex polyhedron R. Using the con- 
vexity of 8? in the initial state x(t,) the absolute maximum 8? over 
R takes place at one of the «. Letting 


§? = xe!’ Dae 
where D is given by equation (86) leads to 


JAA) = max [61,625 «++ , On). (39) 
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V. MINIMIZING THE PERFORMANCE FUNCTION 


Since it is a fairly simple matter to evaluate the performance, whereas 
evaluating the gradient of the performance function requires significant 
computational effort, it is desirable to use a numerical procedure not 
requiring a gradient computation. Notice that J,(A) is not generally 
differentiable. Here, for completeness, the Powell method of minimizing 
a function of several variables without calculating derivatives is pre- 
sented."* Reference 15 contains a summary of the various minimization 
techniques available not requiring the computation of a derivative. 
See Refs. 14 and 15 for a more detailed description of the methods 
and their convergence properties. 

Consider a real, scalar, valued function of N real variables a, , +++ , ay 
written f(a). Powell’s iterative scheme concerns itself with finding the 
minimum of f(a) without computing its derivative. 

Each iteration of the modified Powell procedure starts with a search 
down N linearly independent directions 


M15 N72, °*°* » ON 


starting with an initial guess a, and defines a new set of directions for 
the next iteration. 

An iteration of the recommended procedure, suggested by Powell, is: 

(¢) for j = 1, 2, --- , N calculate d; such that f(a,-1 -+ Ajn;) is 

minimum and define a; = a;_; + 37; . 

(iz) Find the integer m, 1 S m S N, such that f(a,,_,) — f(a,,) is 
a maximum and define A = f(adn-1) — f(m)- 

(tit) Calculate f; = f(2ay — a.) and define 


f 1 f (a,) 
fo = f(y). 
(wv) If either f; = f, or 


(f, a 2fe 5 fa) (hh =< fe ~~ A)’ = ZAC, 7 fa)” 


use the old directions 7,, --- , ny for the next iteration and use ay 
for the next a, , otherwise 

(v) define 7 = a, — a, and calculate \ such that f(ay + An) is 
minimum. Use 


Ni) ° °° > Nm=-1 > 1; Nm+1> *** » IN 


as the new directions and ay + Xz as the new a,. 
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The performance functions, for the two classes of initial conditions 
being considered are given by (88) and (39) in terms of the matrix D 
defined in (86). The major effort in computing the performance func- 
tion is seen to lie in the computation of D. Sylvester’s expansion (see 
page 83 of Ref. 16) for computing e*' is useful in the computation of 
the matrices F and G. 

The basic procedure can be outlined as follows: 


(t) Compute and store C(f)®(t, t,) for te [t, , t)] using (7) and (8). 
(77) Evaluate M using (23). 

(zt) If it is desired to compute the lower bounds to aid in choosing 
the dimension of the approximating system, compute the eigenvalues 
of M and obtain the bounds from the result of Theorem 1. 

(iv) Choose starting values for A and choose the directions for the 
initial search in the modified Powell method to be 


1 0 0 
0 1 

’ 0 »° ’ 0 
0 0 1 


where the above are k’ vectors. 


(v) Use modified Powell method to determine the minimum of the 
performance function. Each element of the vector a in the Powell 
method corresponds to an element of A. 


VI. EXAMPLE 


A linearized missile guidance loop may be expressed in the form 


H 


Z = 2 tS 
1 2) 2 m— i°?? 


tz =U (40) 
where x, is the lateral position deviation from a nominal trajectory, 
x is the lateral velocity deviation, x3 is the attitude deviation in the 
given direction and wu is the control signal. The relationship between 
the attitude and lateral acceleration is given through the time-varying 
gain H/(m — t) which accounts for the loss of mass because of fuel 
consumption. 
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Suppose it is desired to approximate homogeneous solutions to 
(40) for initial conditions (at beginning of a stage) lying in a set 
FR (R is defined later) with solutions of a constant coefficient system. 
The actual system (40) can be written in the vector-matrix form 





z(t) = A(é)x(t) (41) 
with output 
y(t) = [1 0 Ole@) = Crh) (42) 
where 
2, (t) 
x(t) = | x(t) 
X(t) 
and 
0 1 0 
A(t) ={|0 0 2. ; (43) 
0 0 0 
Let 
ly II? - | y(t) dt. (44) 


Before proceeding to find the approximation it is instructive to 
determine the lower bounds on the optimal performance. This will 
naturally aid in choosing the dimension of the approximating system. 
The matrix, M, defined by (38), is given by 


T 
M = / &'(t, o)C’CH(t, 0) dt (45) 
0 
with 


£ W(t, 0) = AC ®(, 0). (46) 


The transition matrix, which is the solution to (46) with the identity 
initial condition, is given by 
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1 t HY ~ ) In (m= ) oe i 

















Be S10 2 —Hn(™=4) 
m 
0 0 1 | 
Evaluating M leads to 
T° T? (m—TYy (m=) Lee \] 
T 5 ae 5 In - +4 — 2mT) 
ye 
M Oo 2 M,, 
Mis Mig Mss ; 
with 
ANE 2 Dg 2 OP mma 2): 
Mos = E 36" 6 
1 __ ane 
a (™ — a 4 (m — THAT + Sm) | 
m 36 
and 


[T?  (m—T)* (m= 2) 2 ‘ 
Bo mn eee 


sn (SF) 4 5 im? — (m — 19} 








M3; = H? 27 
10,73 _ OF +m) __ mM? 
36" a. ea) 
m— T (m — T)(4T + 5m) 
“a ( m a 18 ft 


Let the constants defining the problem be given by 
m = 15 seconds (normalized mass) 
T = 10 seconds 
H = 15 (pound-seconds per slug ) x 107° 


and let the region of initial conditions be given by 
R = {x(0); |2,(0)| S 30feet, | x2(0) | S 2 feet per second, 


| v3(0) | S 1 milliradian}. 
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Evaluating M for the above values of the constants leads to 


10 50 =.206 
M =) 50 3838 1570 


206 1570 8082 
with eigenvalues 


Ai = 8398, deo = 3l, Ag = 1.1. 


We have 
Jo = 8,393 
J, 2 31 
and 
P oan Ue 


Here Jo represents 
max || y ||’. 
zoeR 


The second order approximation thus has the possibility of yielding 
a negligible approximation error. Thus in the remainder of this paper 
the optimal second order approximation will be sought. Thus 


and 


g = [1 Ole. 


The initial choice for A in the iterative procedure is 


4.=/ | 
0 0 


which represents polynomial approximations to solutions of the orig- 
inal system. 
The extreme points of R are given by 


30 —30 30 30 
gg? = 9 ; (2) 2 ; ge = | —9 and gy? _ 9 
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and their negatives. Thus 
JAA.) = max {2°?! De} = 340 
ty 


where D is evaluated from (36). It is thus seen that the performance 
function is far greater than the lower bound and the possibility exists 
for a significant improvement. The result of applying the Powell 
algorithm to this problem yields 


oe Re 0.827 
0.177 X 10° 0.629 x 10°? 
and 
J.(A*) = 33.4 
with the eigenvalues of A* given by 
\(A*) = 0.245 
\2(A*) = 0.30 X 107*. 


The above results are obtained after three iterations of the Powell 
algorithm. The G, F and D matrices are given by 


2 i 2 

771 2230 

ee 296.0 ea 
F= 

112.2 832.6 4241 


and 
44X 107° ~—1.8 X 10 1.2 X 10° 
D=|-18X10"% 7. 3.5 X 107°}. 
1.2 X 10° 3.5 xX 107° 4.2 
Evaluating 


max {a Dr } 
‘ 


gives the maximum approximation error occurring at the extreme point 
30 
ag a, =a) y 


1 
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Figure 1 shows the solution of the actual and approximate system 
for this worst-case initial condition. The solutions are obtained from 


y(t) = 30 — 2¢ + #,3(t, 0) 
and 
30 
g(t) = [1 Oje**G'F} —2}- 
1 
g(t) = 4.82 e' + 19.78 er 
The matrix relating the initial conditions is given by GF, that is, 
&(0) = G'F2(o). 


xo) =| 100 1.86 “Ee a. 
—0.295 —0.271 2.48 


SOLUTIONS IN FEET 





4 6 
TIME IN SECONDS 


Fig. 1 — Exact and approximate solutions in worst case. 
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VII. CONCLUSIONS 


A method for uniformly approximating solutions of linear, time- 
varying, homogeneous differential equations has been presented. The 
problem of approximating systems subject to control or reference 
inputs is considered in Ref. 6 for the class of exponential polynomial 
control inputs. 

One of the objectives of modeling with constant coefficient systems 
was to obtain closed form approximations. Use of Sylvester’s expan- 
sion allows one to derive these closed form expressions. However, 
more general classes of approximating systems can be sought while 
still maintaining the property that approximations are in closed form. 
For example, a general model of the form 


& = p(t) Ax 


where p(t) is a scalar valued function possesses the closed form 
solution 


eee | a . DG) ar [ott 


and p(t) as well as A may be sought as part of the modeling procedure. 
A complexity constraint can be imposed on p(t) by considering it to 
be a polynomial of given degree and the search for the model reduces 
to finding the coefficients of the polynomial as well as the elements of A. 
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A Second Order Statistical Analysis of the 
Operation of a Limiter-Phase 
Detector-Filter Cascade 


By W. D. WYNN 


(Manuscript received August 30, 1968) 


This paper presents a second-order statistical analysis for the cascade of 
a bandpass limiter, and ideal phase detector and a video filter. This cascade 
forms an important subsystem in the mathematical model of some coherent 
communication systems where information ts transmitted by phase or 
frequency modulation of the carrier. We derive the autocorrelation function 
R(t, , te) of the video filter response when the bandpass limiter input is a 
fixed amplitude-phase modulated carrier plus stationary gaussian noise. 
The video filler response ts wide sense stationary for some nontrivial cases; 
these include biphase, single tone, and stationary gaussian notse phase 
modulation. For these cases, we obtain the video filter output average power 
spectrum as the Fourier transform of R(r) for all values of the limiter input 
signal-to-noise power ratio. An application of the results of this paper ts 
the performance of a FM-PM demodulator for a set of parameters charac- 
teristic of one mode of operation of the Apollo Unified S-Band communica- 
tions system. We present the performance as a family of curves of subcarrier 
channel output signal-to-noise power ratio as functions of the limiter input 
signal-to-noise ratio where subcarrier phase modulation index is a param- 
eter. The approach is similar to the analysis by Davenport of the signal- 
to-noise ratio transfer characteristic of an tsolated bandpass limiter. 


I. INTRODUCTION 


In some coherent communication systems, such as the Apollo Uni- 
fied S-band system,! where information is transmitted by phase 
modulating a carrier, bandpass limiters? are used in the IF channels 
preceding the coherent demodulators. Ideally the bandpass limiter 
removes any amplitude modulation that might exist before the signal 
is demodulated. 
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Figure 1 shows a typical coherent phase demodulator used in such 
a system. This demodulator consists of a multiplication operation (a 
phase detector) with post-video filtering. The phase modulated signal 
is multiplied by a coherent carrier reference to yield a video signal 
containing the desired information. The signal into the limiter is 
usually accompanied by noise that is frequently assumed to be addi- 
tive and gaussian. The presence of the noise affects the performance 
of the demodulator in a very complicated way because of the non- 
linearity of the limiter. Thus it is difficult to evaluate the corruptive 
effect of the noise on the demodulated information. 


One criterion of performance at points in a communication system 
is the signal-to-noise power ratio (S/N). For the cascade in Fig. 1, a 
problem of interest to the systems engineer is the video filter output 
S/N as a function of the input S/N to the limiter when the input noise 
is additive, stationary, and gaussian. The relationship is known between 
input and output S/N for an ideal bandpass limiter where the input is 
the sum of a stationary gaussian noise and a signal P(é) cos (w,t + ¢) 
(see Ref. 2). For the analysis there, P(t) is a random process and is 
slowly varying compared with cos w,¢. The carrier phase ¢ is a random 
variable independent of P(t) with a uniform distribution over [0, 27]. 

It is not possible to apply the known 8/N transfer characteristic of 
the ideal bandpass limiter found in Ref. 2 directly to obtain the S/N 
transfer characteristic for the bandpass limiter-phase detector-video 
filter cascade. A knowledge of the form of the signal and the noise 
out of the bandpass limiter, and not just the S/N of this output, is 
necessary to determine the effect of the phase detector on the bandpass 
limiter response. 

To obtain the cascade S/N transfer characteristic we apply the 
mathematical tools used in Ref. 2. The form of the signal assumed in 
the analysis of the cascade is s(t) = P cos [w, + 6(t) + ¢] where P is 


REFERENCE 
P(t) cos [wet +6 (t) + 4] + Noise SIN (wet +) 
/ 








INFORMATION 
| OUTPUT 





| | | PHASE 


[__BAND-PASS LIMITER || _DEMODULATOR | 


Fig. 1—A coherent phase demodulator with IF ‘bandpass limiting in the 
presence of additive noise. 
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a positive constant, @(¢) is phase modulation that is slowly varying 
compared with cos w,t, and ¢ is a random variable representing the 
arbitrary initial phase of the signal carrier. The probability density 
function of ¢ is assumed to be uniform in the interval [0, 27]. The 
noise input to the bandpass limiter is assumed to be additive, stationary, 
and gaussian with zero mean and power spectral density N. The input 
noise, the modulation 6(t), and the carrier phase ¢ are assumed to be 
jointly statistically independent. For the following analysis, the limiter 
is assumed to be ideal with limit level |. The transfer function of an 
ideal limiter is defined by 


+1, x>0 
y=I@)=) 0, «=0 (1) 
le 2% =< :0, 


A coherent carrier reference sin (w,é + ¢) is assumed to be available 
for the demodulator where ¢ is the phase of the carrier. 


II. THE SECOND ORDER STATISTICAL ANALYSIS 


2.1 A Cascade Model when s(t) is Narrow Band Limited 


In order to obtain a S/N transfer characteristic for Fig. 1, the auto- 
correlation function of z(¢) is derived. When R,(¢, , t2) = R,(r) the aver- 
age power spectrum of z(¢) is defined by the Fourier transform of 2,(7) 
and the S/N transfer characteristic can be found. An analysis of the 
autocorrelation function of z(t) does not seem possible for general s(t). 
However, if the signal s(¢) is a narrow band-limited process such that 
the bandpass filters are narrow compared with the carrier frequency », , 
the response z(¢) should be the same with or without the post bandpass 
filter that precedes the phase detector. The response of the nonlinearity 
I(x) to an input 2) = s(t) + n(é) that is narrow band-limited about 
+w, is a family of terms narrow band-limited about the frequencies 
-t-nw, where n = 0,1, 2,3, --- (see equation 13-53, section 13-1 of Ref. 3). 
Any narrow band-limited input to the phase detector that is not about 
-+-w, will generate a phase detector response above the cutoff frequency 
assumed for the video filter. For a narrow band-limited x(t) the auto- 
correlation function of z(£) is obtained from the analysis of Fig. 2. 


2.2 The Derivation of the Autocorrelation Function of z(t) 


Assume that the input x(¢) is narrow band-limited such that Figs. 1 
and 2 yield equivalent z(t). The autocorrelation function R,(é, , t2) is 
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| 
x(t)=s(t)+n(t) 


REFERENCE 
SIN (wot +¢) 


Fig. 2— The narrow band equivalent receiver for the derivation of R-(t:, te). 


obtained by first deriving F#,,(t, , t) from the model in Fig. 2. Since z 
and w are related by the linear video filter, R,(t, , t.) follows directly 
from R,,(t, , t2). 

The Laplace transform solution of a zero memory nonlinearity with 
stochastic excitation is used to derive F,,(t, , £2.) (see Chapter 13 of Ref. 3). 
The limiter characteristic is 


One oo | [ _ fale) exp (0a) do + i fe) exp (ee) te | (2) 


where 


Ee [™ 


I 


lay I IG oi aes, + So Ret Su 


and 


ee [ ; OT OE 7 for HE [al <0: 


I 


The variable » = u + jv is complex with Relw] = u. The contours 
C, and C’_ are taken parallel to the v axis in the w plane with Re [w] > 0 
for C,. and Re [w] < 0 for C_ . For convenience /(x) is written symbolicly 
as 


Ice) = 5 |e) exp (eu) de (3) 
where equation (3) means the same as equation (2) when C., and C_ are 
not the same contours. 

Since w(é) = sin (wt + ¢)-l[x(Q)], the autocorrelation function of 
w(t) is 
1 2 
R(t, , &) = (54) [ flo) [ f(w2)H {sin @.t, + ¢)-exp (18, + 7) 
Td C Cc 


“Sin (w.fe + G): EXP (weS. + woe) } dw, dw» (4) 
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where s; = s(t;) and n; = n(t;), 7 = 1, 2. The order of complex integra- 
tion and the expectation operation have been interchanged to get equa- 
tion (4). For the assumed statistical independence of n(Z), 6(t), and 4, 
the expected value in equation (4) factors into 


E{sin @.t, + ¢)-exp (,8:)-sin (@.t, + )- exp (w282) } 
-exp 3[o°w, + 2R,(r)ww. + ows] (5) 


where 7 = ft, — ¢t,. The form for the cross correlation function 
E{exp (#7) exp (w2%2)} where n(t) is stationary gaussian noise has been 
used in equation (5) (see pp. 476-477 of Ref. 4). 

For the case where s(t) is narrow band-limited with respect to o, , 
the filter in Fig. 2 is a narrow bandpass filter, and is assumed to be 
symmetrical about w, . Then n(é) can be written as (see pp. 373-874 of 
Ref. 4) 


n(t) = x, cos w,t — x, sin w,t 


where x, and x, are statistically independent stationary gaussian random 
processes, and 


R,(r) = R,(r) cos w,7 (6) 


where &,(7) = R&,.(r) = R,,(r). For a narrow bandpass IF filter, the 
transform of #,(7) is lowpass with a narrow bandwidth compared to », . 
With the substitution of 


1, = 1, = t+, 





¢* =o + a,b, 
; _ exp (j6*) — exp (—3¢*) 
sin ¢* = 3j ; 
and 
exp [R,,(7)ww2] = T,,(ww.R,) exp (jmw,7) (7) 


(see Article 1, Chapter 3 of Ref. 5), equation (5) becomes 
—2) DL) [nlowl,) exp (jme.r)-H {exp (joer + j26*) 
BE exp (jor ~~ j2¢*) — exp (joe) — exp (—jw.7)] 
-exp [w,P cos (0, + ¢*) + w.P cos (6, + ¢* + .,7)]}. (8) 
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Since exp (jw,7) and exp [w,P cos (8. + ¢* + w,7)] are periodic in w,7 
with the period 27, the function 

exp (jmw,7)Li{ } (9) 


in equation (8) is periodic in w,r. Since R,(r) transforms to a narrow 
band-limited lowpass spectrum, the autocorrelation function of 2(¢) 
corresponds to the de component of the Fourier expansion of equation 
(9). With the substitution of 6 = w,7, the de component of equation 
(9) is 


at 6 
| 5, °XP (jmé)-E{  } 


+00 +% 27 dé 
>> ys IloP)-IiloP) Ba [ On 


r=—o k=—-0 


| Basexp [Lim +1+h)6+ 72 +7 4+ kt + j(ro, + k6,)]} 


+ Ey {exp [jm — 1+ k)b + (-2 +7 + k)o* + jr, + £62)]} 
— By {exp [jm + 1 + k)b + ir + k)o* + jr + £62)]} 


— Hgetexp [jm — 1 + k)6 + gr + k)g* + jr + roan) | (10) 


Since ¢* = w,t + ¢, d* has a uniformly distributed probability density 
function on [0, 2a]. The averages in equation (10) with respect to 6 
and ¢* follow. For example, the first average with respect to 6 and ¢* 
is zroifw-+kK+1+00rk+r+2 +40,andwhenk = -—1—m™ 
and r = —2 — k = m — 1 the double average is exp [(m — 1)6, — 
(m + 1)6,]. Equation (8) reduces to 


(=a) 20 TafosoaR,)| ISPS {exp [j(m — 1)@, — j(m + 1)6]} 


— Tne laa (exp jm + 1)0, — jm — 1)6)} 
= Inet I oGeeiyl (exp [0m + 1), — j(m + 1)6)} 
az Te TG) Ef exp [p= 1) 0) = fe noah |. (11) 


The terms in equation (11) for positive and negative m can be combined 
by noting that I_,,(«) = I,,(«). With the substitutions 


+00 m+2q m+2qapmt+2q 
Wy, We R; 


T,,(wwol?,) > » Qe! I'(m + qg ot 1) 


a=0 





(12) 
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and 
ine = Be | sda exp (G2) raloP) des, (13) 
the autocorrelation function of z(é) is 


oe Ret” 

Bll sh) = 3 Dy Le gemg GF mi)! 
(Rvicqemito(m +1,m+ 1, ty, b) 

+ Mey ogemlte(m — 1, m — 1, t , ty) 

— Ninst.2q+mlm—1.2¢emleg(m + 1, m — 1, t, , te) 


— Wi easel 4 otley Te — i, a 1, tL , to) | (14) 


ao fh m= 0 
2, m > 0 


and R,(A, B, t; ,t) = E{cos[A@(t,) — Bé(ét.)]} for any integers A and B. 


where 


III. THE CLOSED FORM SOLUTION FOR hy, ;, 


The autocorrelation function of z(t) given in equation (14) contains 
the constants h,, , where m + k are odd integers. For the ideal limiter 
characteristic of equation (1), there are closed form solutions for these 
parameters. Since f,(w) = l/w for Re [w] > O and f_(w) = 1/w for 
Re [w] < 0, equation (13) becomes 


1 


2 2 
—— Ret e. 
On lw” "I n(wP) exp ( 5 dw 


Din -& = 


eae (“s' 

ae is lo'"T(oP) exp (S-) do (15) 
where C_ is the contour (—e — jo, —e + 7) and C, is the contour 
(+e — jo, +e + 70). By the change of variable w = jx and the sub- 


stitution of I,,(2) = (j) "J.,.(jz), analytic continuation can be applied 
form = 0 and k = 0 to give 


Rk = = (jie [2 OD 7 (tP) exp | =" 5 "| a. (16) 


When m + k is even, the integrand of equation (16) is odd and f,,,, = 0. 
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When m + k is odd, the integrand of equation (16) is even and 


(™ $ a)" mtk —Pp’ 
er pe m+ Ze) an 


where a solution has been used for the integral 


ee * A (nP) exp | =S* =] de (18) 


in terms of the confluent hypergeometric function ,F\(a; 8; —x) (see 
equation A.1.49, p. 1079 of Ref.6). For the case when m and k are non- 
negative integers ,/",(m + k/2; m + 1; —2x) can be expressed in closed 
form in terms of first and second kind modified Bessel functions. A 
list of these expressions is given by Middleton (see equation A.1.31, 
section A 1.2 of Ref. 6). A collection of h,,,, in closed form for low order 
indices is given in Table I. For Table I, x = P’/2o’ is the input signal- 
to-noise power ratio into the limiter in Fig. (2). 

Any of the h,,,, in equation (14) can be found in closed form from 
Table I by using the recurrence relations 





Pun an aren : 














4 1 
Nint® ch = liga: — ain ED » Nga: k-1 + aoe - ed)m m,k~-2 3 (19) 
P k—m-—2 
nse aa = oe m,k ~~ edecihe 
Co Co 
Xk —m—2 
+ Em (20) 
and 
—k P? —k 
Nin .eee = a) Poy ae “4 Rin-2, ae ial . (21) 
Co o o 


Equation (19) is derived from equation (16) by using the Bessel 
function identity 


2m + 1) . 

Px J 
Equation (20) is derived through a by-parts integration of equation 
(16) and the application of equation (19). Equation (21) is derived 
through by-parts integration of equation (16). In the development of 
equations (19), (20) and (21), the integral in equation (16) is re- 


J ms2(aP) = m+i(tP) — J (xP). (22) 
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TaBLE I— CLosED FoRM SOLUTIONS OF SOME h,, .t 



























































ml k Ina ae, 
1/0 @ a, e 71 To(a/2) + I,(2/2)] 

oft] Berner - 
211 =o e777, (2/2) 

1|2 Bae o**UTd(0/2) — 1,(e/2)] 

slo| o aie "| 10/2) — (1 + re | - 
ol3 a eT — a)Ip(a/2) + 21,(2/2)] 

. 3 aires "| Toe/2) ss (1 ee Vr a/2)| 

4/3] oor 3 - el (44 sees ay (¢/2) — (1 + )r@/ | 

ile ar e713 — 22)Io(a/2) + (Qn — 1)I,(2/2)] 7 
3] 4 ane onl (1 4 1) i 0/2) = (1 ue B 4 2.)rcu/2) | 


Qv' 


@ ae L045; +4 2 Vraee/2)- (1434418 *8)7.(0/2) | 
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stricted to the half interval [0, oo) which is possible since the inte- 
grand of equation (16) is even when m+k is odd. 


IV. THE AVERAGE POWER SPECTRUM OF 2(t) 


The autocorrelation function of z(¢) given in equation (14) becomes 
time independent such that z(¢) has the average power spectrum S,(w) = 
F(R,(7r)] when R,(A, B, t,, t) = R,(A, B, r) for integers A and B. 
There are some important cases of @(¢) for which FR, is time independent. 

If 6 is a biphase modulation with @@) = + | @| that has a zero mean 
and autocorrelation function (see equation 9-42, section 9-2 of Ref. 4) 


por (1— lz), foe: je eer 
Ri(r) = (23) 
0, for |r|>T7 
then 
RA, B, ty , t) 
= cos A | @|-cosB | 6|-+ sin A | 6 |-sin B | 6 |-r4(7) (24) 


where 7(r) = R,(r)/| 6|? is the normalized autocorrelation function 
of @(t). Then R, is a function of 7 = t, — ¢,. 

For a single tone modulation given by @(f) = m, sin (w,t + £) where 
£ is a random variable with a uniform probability density function 
on [0, 27], a simple Bessel series expansion gives 


RA, B, ty, te.) = do ex en(Am)Jon(Bm,) cos (2nw,7) 


=0 


3 


= 3 End on-1(A IM) J on—1(Bm,) cos [(2n — 1)a,7] (25) 


n=] 
_ n=0 
& = 
2, n> 0. 
For the single tone modulation, R, depends only on the time difference r. 
If a(t) is the sum of tones 


where 


() = D> mysin (wt + 6) (26) 


where §&, p = 1, --- , N, are independent random variables with 
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uniform probability density functions on [0, 27], R» is again independent 
of time. 

If a(é) is a stationary gaussian process with zero mean, variance «7 
and autocorrelation function K,(r), then R,(A, B, t, , t:) = R,(A, B, 7). 
The second-order characteristic function for the stationary gaussian 
process is defined as (see equation 112, Chapter 7 of Ref. 4) 


E(exp {jlws O(E + 7) + «2, 0(t)]}) (27) 
exp [—4K (0) @; + wo) = Ko(r)a,we]. 


By, , 2 5 7) 


Then 
RA, B, t, , t2) = Real Part H{exp GA, — 7B6.)} 


= exp | -2 (A? + B | -exp [ABK,(7)] (28) 


R,(A, B, T). 


The validity of equation (14) depends on the narrow band-limited 
assumption for the modulated signal s(t) at the carrier frequency 
w.. For s(t) to be narrowband limited, the parameter values that the 
modulation functions can have are restricted. 


Vv. AN APPLICATION OF THE R, RESULTS TO THE PERFORMANCE OF A SUB- 
CARRIER CHANNEL 


A modulation technique sometimes used for communication is FM- 
PM where the carrier is phase modulated by a subcarrier that is in 
turn frequency modulated by the information waveform. The FM- 
PM signal is of the form 


sit) = P cos {wt + @ + m, sin [w,¢ + & + AM)]} (29) 


where P, w, , w, and m, are constants, ¢ and £ are independent random 
variables usually assumed to have uniform probability density func- 
tions over [0, 2], and \(Z) is the integral of the information waveform. 
In a typical application, w, >> w, and A(é) is slowly varying compared 
with cos ,¢. With these restrictions the information \() can be re- 
covered from s(t) with the receiver shown in Fig. 3. 

The purpose of the bandpass limiter is to remove the effect of varia- 
tions that might occur in P. For the ideal case where s(¢) is not per- 
turbed by noise, the subcarrier filter input z(t) is 
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REFERENCE 
SIN (act +¢) 


Fig. 3— FM-PM receiver with ideal bandpass limiter. 


a(t) 


I 


alee {m, sin [w,t + (4) + €]} 
- (30) 


7 2 > Joni(m,) sin {(2n — Dfert + (1) + §]}. 


If A(t) is slowly varying compared with cos w,¢, the information can be 
recovered with a subcarrier filter that passes only the first component 
of the sum in equation (30). For the noiseless case the subcarrier filter 
response is then 


—2 F.(m,) sin fot + MO + El. (31) 


After additional processing in a subcarrier demodulator, A(é) is obtained 
from equation (31). One criterion of performance of the receiver is the 
S/N out of the subcarrier filter as a function of the limiter input S/N, 
z = P’/2c’. Since X(t) varies slowly compared with cos ,t, the output 
S/N for the subcarrier filter is determined with sufficient accuracy by 
setting \(é) = 0. If \(¢¢) = 0, the subcarrier output S/N follows directly 
from equations (14) and (25). Substitution of equation (25) into equa- 
tion (14) gives the power spectrum 


5.0) 


ie ate (wit+é) 


= hy YS Shx(m)-Fleos Gn — Derr] 
+: (2) oho = ohoyJ (2m,)]"-F[r.(7)] 
+ (4)07hi: yD J2(2m,)-F[r,(7) «cos (nw,7)] 


+ (3)o" i » Ten—1(71) °F [r5(7) + Cos (2n — 1,7] 


+ (5) SS) efo*hied,(m,) — o7haJ,(3m,)]”-F [12 (7) - cos nw, 7] 


n=0 
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ar (32) [o°hos a o NosJ o(2m)]” -F[ri(1)] 


+ (H5)0°h33 D> J2(2m,)- F[r3(7) - cos nw, 7] 
n=1 
+ (ss) SS eo AogJn(2m) — o'lggJn(4m,)]? F[r3(7) - cos nw, 7] 
n=0 


a (sao hie > Jon~1(7My) -F{r}(7)- cos (2n — 1)o,7] 


eo 


+ (rhs) ©} alot .(m) — ohgsJ (8m) ]°- Flri(a) - cos nw, 7] 


n=0 


+ (rts) YS enlo*hgid (3m) — o'AssJ (5m) |" F[ri(r)-cosnw,7] (82) 
n=0 
where r, = R,/R,(0) = R,/o’. The approximation, equation (32), 
neglects all the terms of equation (14) containing the factor R?**” 
where 2¢ + m > 4. The terms in equation (32) are the significant terms 
of S,(w) for the single tone modulation. The spectrum in equation (32) 
is the weighted sum of terms of the form 


BieG) cosmae= = F[r'(2)] * Fleos mo,7] (33) 


where * is the convolution operation. Since F[cos mw,7] is a pair of 
impulses of weight 7 at --mo, , 


F[ri(7) cos ma,7) = 4[S8,.@ + mw, + S,,,.@ — ma,)] (84) 


where 


Sa) = F[ri(z)]. 


The first term in the spectrum of equation (32) is the signal content 
of z(t). All other terms of equation (32) correspond to noise alone or 
a combination of signal and noise. All terms of equation (32) except 
the first term are usually combined to give the interference (noise) 
spectrum at the output of the video filter. 

A computation was made for the subcarrier filter output S/N as a 
function of the input S/N zx. The following conditions are assumed for 
the computation. 


(t) The power spectrum of the input gaussian noise to the cascade 
in Fig. 1 is uniform over the bandwidth of the prelimiter bandpass 
filter. 
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(71) The prelimiter bandpass filter is assumed to have a gaussian 
transfer function such that 


2 2 
n(n) = exp Bel (35) 
(wt) The subcarrier amplitude transfer function is 
A A 


| H(jw) | = (36) 
0, all other a, 


where Aw < w). Also, w = 12.566 X 10° and w, = 6.484 X 10° are 
assumed. Substitution of equation (85) into equation (34) gives 


be 7 wr (w ‘| 
S,.n(0) = F[ri(7)] = wala)? exp |- rs (.) ‘ (37) 
From condition 777, the noise spectrum in the passband of the subcarrier 
filter is approximately constant when w = w,. The signal and noise 
powers out of the subcarrier filter follow from S,(w,). The signal power 
is 2h7,J7 (1m); the noise power is 


foo) 


a, |. [S.@) — 2hiodi(m1)-F (cos w:7)]-| H(jw) |? dw SS [Sl@,)]-2 Af 
where Af is the width of the subcarrier filter and where S/ is equation 
(82) with the first term omitted. The function 


Qiod (21) 
a[S2(@1)] 
was computed for x between 0.01 and 100 with m, as a parameter. The 


results of the computation are shown in Fig. (4). For a given m, and 2, 
the output S/N for the subcarrier filter is 7/2 Af-S(m,). 





VI. SUMMARY 


A general, second order statistical analysis is presented for the cascade 
of a narrow bandpass limiter, an ideal phase detector, and a video filter. 
In this analysis, the input to the limiter is assumed to be the sum of a 
stationary gaussian noise and a fixed amplitude phase modulated sine 
wave. The autocorrelation function of the cascade response is obtained 
as a function of the signal-to-noise ratio x at the limiter input, the nor- 
malized autocorrelation function of the lowpass equivalent for the 
limiter input noise 7,(r), and the phase modulation 6@(é). 

The cascade response z(t) has the autocorrelation function R,(t, , t2) 
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Fig. 4— The unit bandwidth subcarrier filter output, S/N normalized by x 
where 10° S x S 10’ and Mi is a parameter. 


that can be time dependent. However, for some important cases of 6(¢), 
R.(t,, t2) = R,(r), and the cascade response has the average power 
spectrum S,(w) = F[R,(7)] where F is the Fourier transform operation 
with respect to 7. The cases of @(¢) considered that yield R,(r) are the 
random biphase waveform 6= -+-| @ |, the single tone @(¢) =m, sin (w,t+8), 
and the stationary gaussian process with autocorrelation function K,4(7). 

The dependence of #,(¢, , #2) on the limiter input S/N appears in the 
h parameters. These parameters can be obtained in closed form as func- 
tions of the modified Bessel functions J)(v/2) and I,(#/2). The lower 
order h parameters encountered in the first few terms of the series for 
R, are found, and recurrence relations are derived through which 
higher order h parameters can be derived easily. 

For the modulation types that make R, a function of 7 alone, the 
power spectrum S,(w) is known for all values of the limiter input S/N z. 
Then the S/N can be derived in any frequency band at the output of 
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the video filter in Fig. 1 as a function of any S/N into the limiter. 

The performance of a subcarrier channel was considered where 6(t) = 
m, sin [wt -+ A(t) + €]. The subcarrier was assumed to be phase mod- 
ulated by a narrowband low pass process X(t). The S/N at the output 
of the subcarrier filter was obtained by computation of the approxima- 
tion of equation (32). For this example, a gaussian prelimiter bandpass 
filter was assumed. For this filter shape, r?(7) and its transform S,,,(w) 
are gaussian for all integers n. Some representative parameters from 
the Apollo unified S-band communication system’ were assumed. These 
were 


(<) A prelimiter noise equivalent bandwidth of 4 MHz. 
(iz) A subcarrier frequency of 1.024 MHz. 
(zit) A subcarrier noise equivalent bandwidth of 0.2 MHz. 
(iv) An input S/N range of 0.01 < x S 100. 
(v) A set of modulation indices m, = (0.2)k, k = 2, 3, 4, 5, 6, 7, 8, 
9, 10. 


The results are given in Fig. 4. The differential between subcarrier filter 
output S/N at low and high values of x is a monotonically increasing 
function of m, for 0.4 S m, S 2.0. The shapes of the curves are similar 
to that of the (S/N)./(S/N), curve obtained by Davenport.” 
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Multiplex Touch-Tone® Detection Using 
Time Speed-Up 


By J. F. O'NEILL 
(Manuscript received August 19, 1968) 


A signal may be read from a storage medium faster than the rate that 
would correspond to real time reconstruction of the signal; this process has 
been named time compression or time speed-up. Cheap serial shift_registers 
make time speed-up an attractive means to detect Touch-Tone~ calling 
(or other format) signals on a multiplicity of channels using a single 
detector. 


I. BACKGROUND 


Time speed-up (TSU) of a signal consists of reading the signal 
from a store faster than the rate at which it was recorded. (This is 
generally faster than real time reconstruction of the signal, thus the 
name). I propose this process for multiplexing several voiceband 
channels in time, so that one multifrequency receiver can detect 
Touch-Tone® signaling on a multiplicity of channels. 


Processing of a single signal using TSU configurations based on 
electric or acoustic delay lines (called prLt1c systems, for delay line 
time compression) has been done since the 1950’s.**? At Bell Lab- 
oratories, TSU is being investigated for multiplexing Picturephone 
visual telephone channels on slightly nonlinear microwave radio sys- 
tems.” 

The inherent simplicity and versatility of a digital TSU signal 
processing system is enhanced by the availability of inexpensive serial 
shift registers based on the insulated gate field effect transistor. These 
registers typically store 64 bits, and are sufficiently fast to permit a 
single detection circuit to serve eight to 16 Touch-Tone voiceband 
signaling channels or hundreds of channels in a low frequency ap- 
plication, such as 20 Hz ringing detection. 

The attractiveness of TSU multiplex tone detection is demonstrated 
by, and most of this article treats of, the Touch-Tone detection case. 
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If the system is realized as a digital multiplexer (concentrator) in 
tandem with an analog frequency detector, it will become apparent 
that the frequency detector can (but need not) be exactly the type of 
circuit now used, but with a scaling applied to all reactances. This 
scaling is to raise all spectral features by a constant factor which is 
the ratio of time compression. Thus, all the linear and nonlinear signal 
processing now used can be included in a TSU system and its per- 
formance would simulate that of present Touch-Tone receivers. By 
adding additional data smoothing, which could involve using the 
signal samples more than once, the present tolerance to digit simula- 
tion by speech can be exceeded. 


II. SIXTY-FOUR CHANNEL TSU RINGING DETECTOR 


An exploratory key telephone system must detect the presence of 
20 Hz ringing on 64 central office lines. This detection could be per- 
formed on each channel, but the availability of 64 bit serial shift 
registers has made centralized TSU detection economically more 
attractive. 

Figure 1 shows the TSU arrangement to be used in this exploratory 
system. A transducer V;,7 = 1,2... 64 at each channel slices (limits) 
the ringing signal and presents a rectangular wave at logic level to the 
sampling gates S,;. Binary data is sufficient to specify the input signal 
because it is basically a single tone; there are interfering tones from 
power line cross-coupling but are suppressed to a large extent by the 
larger 20 Hz signal and the limiting operation. (It will be apparent 
that a multitone format such as Touch-Tone signaling would not be 
well represented by binary coded signal samples.) The sampling gates 
load the long serial register SR1 with a sequence of samples V, from 
all the channels. The order of the samples is the same as the order of 
the channels: ...,1,2,...63,64,1,2,.... 

The register SR1 has taps every 64 bits, however, and at these m 
taps (including the input and output) the samples at any instant are 
all for the same channel, as shown by V,. These samples can be pro- 
cessed in a high speed detector, and the result registered in either a 
common or per-channel answer depository. A digital detector, for 
instance, could examine the m samples in a time consisting of a few 
logic gate delays. Alternatively, the m samples can be placed in an 
independent register SR2 as shown in Fig. 1, from which they can be 
clocked into an analog frequency discriminator of any type, such as a 
two-pole resonator. With this system, the SR2 read-out clock is in- 
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Fig. 1— 64 Channel TSU single frequency detector. 


herently a part of the detection process, since it controls the ratio of 
time speed-up; if the channels are sampled at rate f, per second and 
the SR2 register is read at kf, per second, then k is the ratio of time 
compression (and spectral expansion). 

Modifications to the Fig. 1 TSU tone detector permit detection of 
a single tone of unspecified frequency. To do this, more detectors 
could be added at the output of SR2. The same result would be at- 
tained by using only one detector with various clock rates to read out 
SR2, and a return path from SR2 output to input, so that the samples 
for a particular channel could be processed repeatedly. 


In the exploratory key telephone ringing detector the 64 channels 
are sampled at seven times per cycle of the input 20 Hz wave and a 
digital detector is used to examine the samples from one cycle (m = 7). 
The detector stores this tentative result in another serial shift register, 
and when enough 50 msec intervals appear to have ringing present, a 
RING output is delivered to the common controller. This detection 
operation is low Q, but this is by design, and is not dependent on either 
the TSU structure or the technology. An analog detector in this system 
would need m > 7 but would not require the added integration. As 
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always, an appropriate trade-off between selectivity (high Q) and fast 
detection (low Q) must be made. 

The effectiveness of the detector in suppressing frequencies other 
than 20 Hz is a function of the sampling rate, as well as the param- 
eters of the detector. Waveform preservation is not necessary for 
detection, so the sampling theorem requirement (two samples per 
cycle at the highest frequency of interest) need not be met. As few 
samples as possible should be handled to conserve storage, but the 
lower limit is set by the signal duty cycle variation and the size and 
frequency of the interfering signals. The equivalence between pe- 
riodic sampling and modulation permits intelligent selection of the 
sampling rate. 


Ill. EIGHT CHANNEL TSU T'ouch-Tone DETECTOR 


The TSU configuration of Fig. 1 could be adapted to multifrequency 
detection by means of a few additions. First, incorporate at the regis- 
ter SR1 input an analog to digital converter to code the signal sam- 
ples sufficiently accurately to preserve the information content, say 
b bits per sample. Replace shift register SR1 by b parallel shift regis- 
ters, one for each bit at the analog-digital output. Finally, add a 
digital to analog converter at the output of the b parallel read-out 
registers (SR2 in Fig. 1). No change in principle is involved; the 
added circuitry only preserves the signal amplitude through the TSU 
system. A delta coder with a (longer) single shift register could be 
used for the digitizing operation; the type of code is a detail. 

However, the structure of Fig. 1 is not well suited to Touch-Tone 
detection. The serial registers are conveniently available in 64 bit 
and larger sizes. (Smaller sizes would be economically wasteful; adding 
taps increases the lead count perilously.) Only seldom is there a 
need to detect 64 Touch-Tone signals simultaneously, and reliability 
requirements would be excessively difficult, even if the need existed. 
By using the Fig. 2 TSU configuration, the 64 bit registers are used 
very efficiently. 

In Fig. 2, each channel has a private b-register store. The channel 
1(t = 1, 8) inputs are sampled in multiplex by switches S,; and coded 
by a common analog-to-digital converter. The coded samples are 
steered by logic gates S,; to the registers for channel 7. Sometime be- 
tween (or synchronized with) input samples, the registers are read at 
high speed into the digital-to-analog converter, which is assumed to 
be simple enough to build for each channel. Transmission gate Sx 
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Fig. 2— Eight channel TSU Touch-Tone® dialing detector. 


simultaneously connects the (per channel) converter output to the 
detector input bus, so that the detector is time shared by all channel 
circuits. This detector can be a carbon copy of any of the standard 
receivers, but with all reactances scaled up in frequency by the time 
compression ratio. Or, it could be all-digital. In either case, the read- 
out of the channel register bank must be sufficiently fast to permit 
the detector to answer and return to quiescence before the next chan- 
nel is examined. 

An important feature of the Fig. 2 parallel register TSU system is 
that the channel registers need be supplied only for as many channels 
as are actually required. The Fig. 1 serial system must be built en- 
tirely in order to operate at all. 

An 8-channel Touch-Tone receiver using 3 bit (b = 8) coding has 
been built and operated by Mr. R. J. Violet of Bell Telephone Labora- 
tories. In this demonstration system, the channel sampling is done at 
4000 Hz with 64 samples being stored per channel. Each channel is 
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examined every 4 ms, and the detection requires 0.5 ms. Half of this 
0.5 ms is to allow the receiver to become quiescent. The time com- 
pression ratio is the sampling interval divided by the sample read time, 
or 64. The detector is thus constructed for input frequencies at 64 
times the normal Touch-Tone frequencies. This simple demonstration 
system immediately registers the detected results through gates Su 
in a per-channel flip-flop bank (shown in skeletal form). An attractive 
feature of TSU detection is that further processing, such as delay or 
data format conversions, can be made by common equipment. Thus, 
Touch-Tone signal to dial pulse translators for conversion of step-by- 
step switching machines could be very effectively built using TSU 
multiplexing. 

If the input signal can vary considerably in amplitude, either a per- 
channel automatic gain control or more accurate sample coding would 
be required to preserve the signal waveform through coding and decod- 
ing. Also, a sampling rate higher than 4000 Hz and a larger number 
of samples per detection might be used in a production circuit. In 
compensation, a rate of more than 8 channels is within the speed 
capability of the circuitry; additional signal integration to improve 
the tolerance to digit simulation is easy to incorporate. 

The economic advantages of large scale production can be ania 
through the use of 64 bit serial shift registers in many of the digital 
systems. Preliminary economic analysis indicates that the marginal 
cost of one Touch-Tone detector in a TSU multiplex system would 
be less than the equivalent single channel receiver; a cost crossover 
can be expected at about three channels. In comparison with multiplex 
receiving based on digital filtering, TSU offers easier maintenance, 
per-channel modularity, and the ability to incorporate future im- 
provements in the detector circuitry. 
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Data Transmission Error Probabilities 
in the Presence of Low-F requency 
Removal and Noise 


By B. R. SALTZBERG and M. K. SIMON 
(Manuscript received July 19, 1968) 


Upper bounds on error probability are derived for data transmission 
systems which are subjected to gaussian noise and to the removal of the 
low-frequency components of the signal. This error probability can be quite 
low for random data, even though the eye pattern is closed. Both standard 
format and partial response signaling are considered, as are binary and 
multilevel alphabets. Numerical results are given for a high-pass filter 
containing a single pole and for a cascade of several such identical filters. 


I. INTRODUCTION 


It is frequently desirable, or unavoidable, that the low-frequency 
components of a data signal be eliminated. This may occur through 
the use of capacitor or transformer coupling in the terminal equip- 
ment or in the baseband transmission facilities. Another instance 
results from the necessity of removing low-frequency baseband com- 
ponents before modulation in order to provide a spectral guard band 
in the vicinity of the carrier frequency. 

Since de is usually completely attenuated, no linear operation can 
correct for low-frequency removal. One commonly used approach 
uses nonlinear feedback to restore the low-frequency components.’ 
Another solution to this problem involves dc-free signal formats.” 3 

We evaluate the penalty resulting from the removal of low-fre- 
quency components from a standard format data signal (Nyquist I 
shaping) and a partial response signaling format (multilevel exten- 
sion of duobinary with precoding.)* Clearly, in both of these cases, 
the degradation is most severe when the transmitted data sequence 
contains long strings of identical digits. In fact, when the system 
bandwidth is less than the signaling rate, which is usual in data 
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communication systems, the received signal will be zero. This follows 
from the fact that for a periodic input impulse train the lowest fre- 
quency components are at de and the signaling frequency, both of 
which are filtered out. However, the degradation of a random signal 
can be quite small when the cutoff frequency of the offending high 
pass filter is far below the signaling rate. 

We consider binary and multilevel data-transmission systems with 
signaling formats as above, degraded by a single-pole high-pass filter 
or a cascade of such filters. The systems are evaluated for error proba- 
bility in the presence of additive gaussian noise. A previously derived 
error probability bound® is used, which takes the form of a gaussian 
distribution of the signal to noise ratio, in which the larger intersymbol 
interference components subtract from the signal amplitude and the 
smaller ones add to the noise power.® In general, the optimum splitting 
of intersymbol interference terms between signal amplitude and noise 
power cannot be determined analytically. We show that for inter- 
symbol interference components, related by a single exponential damp- 
ing factor, an optimum subdivision can be explicitly specified. Where 
the eye is open, the error probability bound is given directly in terms 
of the eye opening to rms noise ratio. 

We also discuss the refinements of the generalized bound in the 
case of interysmbol interference from a single exponential signal tail, 
and then apply the results to Nyquist I shaped and partial response 
signaling formats respectively. Single poles and a cascade of identical 
poles are considered, and numerical results are given for practical 
data system parameters. 


Ii. DERIVATION OF A SIMPLIFIED ERROR PROBABILITY BOUND FOR SINGLE 
EXPONENTIAL INTERSYMBOL INTERFERENCE 


Reference 5 gives an upper bound for the probability of error in 
the reception of a random digital message perturbed by gaussian 
noise and intersymbol interference. This gives 


[ —-W-1) DA 
P, S A exp ¢— =____**. (1) 


jot + ME > a] 


kyK 





which is subject to 





fo 
doh <q 
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where 


N is the number of levels of the input random message. 
o, is the variance of the additive noise. 


f(@) is the signaling waveform. 


2 is the signaling rate. 


T 
ni f(kT) | for standard format signaling 
.= 
| f((k — 4)T] | for N level partial response signaling with 
precoding* 
and 
aN) ) for standard format signaling 
A= 
2 — 
NB for N level partial response signaling with 


precoding 


We notice that the applicability of the error probability bound to 
partial response signaling formats was not discussed in the original 
paper but is presented here as a further extension of the result.” 
The sets k « K and k ¢ K include all members except k = 0. It is 
also shown in Ref. 5 that 
fe > ‘ (2) 
teK meK 
0 is rearranged in 


Thus, if the signal sample set {f,} excluding k = 
{gn}, fen the sums in 


order of decreasing magnitude to form a set 
equation (1) may be replaced by 


os = 2 Jx 
keK k=1 (3) 


Dh 


kgK k=M+1 


T 
M 
Se 


Tor an arbitrary signaling waveform, f(t), the optimum J/ [in the sense 
of minimizing the right side of equation (1)] must be determined by a 
trial comparison method as decribed in Ref. 5. 


*Jn the partial response case, f; must be replaced by fi—fo in both numerator 
and denominator summations of equation (1) since only the unintentional inter- 
symbol interference should be included there. 
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For an exponential signal tail, 
fe = Thi-1 ; Om tr = a k = 2,3, --- (4) 


Thus, since f(é) is already monotonically decreasing for all t = T, 
the ordered sets {f,} and {g,} are identical in this case. 


Dae) file) ] 
l1—r (5) 
2 jie 
k=M+1 ‘1 L-¢r 


To minimize the right side of equation (1), it is sufficient to maximize 
1 — 
faethe one my 


Q S=S— es 
) 2M 
2| g? sae 
a+ kek | et + fi 1 


Differentiating Q with respect to M gives 


ae oe a 
reac ele | — = Dh) ‘| 3 1 (7) 
/|a+ Se reral 


go = eX (0<21) 


(6) 














where 


and 


We) 
l—r 


fi —2)<fo. | (8) 


Three separate cases must now be examined. 

(4) If fp — (N — Dfi/ — r) < 0, then the eye is closed. From 
equation (7) it follows that dQ/dM < 0 for 0 < « ¥ 1. Therefore 
the positive maximum of Q occurs at the boundary z=1, so the optimum 
value of M is M,,, = 0. 
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(iz) If 
2 —_ a Co 
a,(N — 1) S97, 
EA ae 
3 1+ r/|°° l-—r 

it is implicit that f, — (N — 1)f,/(1 — r) > 0, and the eye is open. In 

this case it is again true that dQ/dM < 0 for0 <2 S1l,andM,,, = 0. 
(iz) If 





o,(N — 1) 2a: 
a es) 
3 1+r/-° (1 — 1) 
it is again implicit that f, — (V — 1)f,/(. — r) > 0, and the eyeis open. 


In this case a positive maximum for Q occurs in the intervalO < # < 1. 
Solving for the point where dQ/dM = 0, we obtain 


port — oN — 1) 
(* = Nf oN Dh fh ) 
3 : 1—r /\l+r 
Notice that condition (8) is automatically satisfied. 
Since the solution for /,,, as given by equation (9) is not necessarily 
integer, the error probability bound as given by equation (1) must be 


modified in terms of the actual choice of an integer I. We will arbitrarily 
use the next higher integer. Letting [1/,,,] denote the next higher integer 


to M,,, and 
= af *)(3 ) 
~ AN + 1/\1 = 7/7? 


equation (6) may be expressed as: 
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where 
[Mont] b One 
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Taiz = (N — 1)f:/(1 —-r) denotes the maximum intersymbol inter- 
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ference and S, = f, denotes the signal amplitude. Combining equations 
(10) and (11), 


‘Ss = [iaulSs go  s) — 


“4 
20 _ S, a i as _ [(S, = 1 Rey + boiz] : 
- 2 Boke (ew 


(S, a“ Tag: 
Since r < b < 1, 


((S, = see) + bor] = (& ay Tux) (5: a ae) 
2Q > oS) = Fa): be 7 en a a + bz> =, + rZ. 


In terms of the error probability, 
P, < A exp [—Q] 


| (Se— Jame)? 4.3 iS Ne +1) 
3 INN UNE 
Ao 5 ; 


For the situations where M,,, = 0 (that is, cases 7 and 72) equation (1) 
becomes 


S? 
P< A ey 1 
<A ew] fei (NAN(o2)r j ve 
aE NN Sty i ee 


III. ERROR PROBABILITY PERFORMANCE WITH A STANDARD FORMAT INPUT 
DATA SIGNAL : te 


(12) 














Figure 1 is a block diagram of the system considered. Although a 
baseband system is shown, a system using linear modulation and de- 
modulation can readily be fit to this model. P(w) is the basic shaping 
filter and it is assumed that the receiver is matched to this shaping 
filter. For simplicity, P(w) is chosen to be real. The added noise is 
white gaussian. H(w) is the narrow high-pass causal filter whose effects 
are considered. Since H(w) is narrow, it makes little difference whether 
the noise is added ahead of, behind, or somewhere in the middle of 
this filter. 

The source generates symbols randomly from an N-ary alphabet 
at a rate of 1/T symbols per second. The transmitted signal may be 
represented by . 


i) 


s(t) = >) ap(t — kT) 


m= CO 


where the a;,’s are independent, zero-mean random variables which take 
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one of N equally spaced values with equal probability, and p(é) is 
the impulse response of P(w). 

It is assumed that there is no distortion other than H(w) and that 
Q(w) = P?(w) is a Nyquist shaped filter of bandwidth less than 1/7, 
so that 


qikT) =0, all k #0. (14) 
If we let P(O) = 1, then 
1 
(0) = 3- | Q) dw = T/P. (15) 
The power of the transmitted signal is 
_ (Ai)av / 2 age 
S = on P*@) dw = T? (16) 


where o° is the variance of a, . 
The signal presented to the sampler may be written in the form 


oe 


ri) = Do alate — RP) + et — kPY] + 0) 


where e(t) is the error signal caused by the low frequency removal, 
H(w). From equations (14) and (15), 


r(mT) = an| 2 -- «0)| -- 2 a,el(m — k)T| + n(mT). (17) 


The effect of the low frequency removal is both the reduction of the 
signal amplitude [since e(0) is negative] and, more important, the in- 
troduction of intersymbol interference. 

The Fourier transform of the error signal is 


Eo) = Q@)[H(@) — 1] (18) 
so that 


he / ; ath aay ae 


where hA_,(é) is the inverse Fourier transform of [H(w) — 1]. 

In all cases of interest, H(w) — 1 is much narrower than Q(w). The 
time function h_,(é) therefore is virtually constant over a time interval 
equal to the effective duration of q(t). We may therefore approximate 
q(t) by a delta function, whose area is unity since Q(0) = 1. 


sii [ ; Sache Gas. 
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Fig. 1— System block diagram. 


If H(w) is causal (which is the case we are interested in), then 


0 t<0 
et) = \2h.,(0°) t= 0 (19) 
(ee i> 0 


where e(é) is the negative of the impulse response of a narrow causal 
low-pass filter. The generalized bound given in equation (1) can be 
applied to this case as: 


1 2 
vy | [pteo-o-9 Blalll 7 
Se eee re eg ee (20) 
alot + 7 Da] 


k¢K 


P, 





The quantity o° is the noise power at the sampler input and is also 
equal to the noise power at the receiver input, measured in a bandwidth 
equal to half the signaling rate. For the N-level system, 


fia Nod, 
; 3 


In terms of the signal power, equation (16), equation (20) may be 
rewritten as 





(1+ 90)-(N-) Dlg IP 
P, < aN exp = ; keK (21) 
| 201% + Dal 


where g(t) is the normalized error signal 
g(t) = Tel | a 
Giw) = T[H() — 1). 


To apply the simplified bounds derived in equations (12) and (13), we 
must first specify the high pass filter, H(w). 
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3.1 Single Pole Filter 


A very common type of low frequency removal results from the 
use of a single capacitor or transformer. The transfer function is 


H(s) = 





a 


where 7 is the time constant of the low frequency removal circuit. Its 
corner frequency is then 1/(27r). From equation (22), the normalized 
error signal is 


T 
sr +1 





G(s) = — (23) 


and 
ot) = — exp (up 


where u(é) is the unit step function. Introducing the normalized quantity 





a=4, (24) 
then 
0 k<0 
g(kT) = ae k = 0. (25) 
—a exp (—ka), k>0 
Letting 
Tfo = 1+ gO) 
Th = |g |, k=1,2,--- (26) 
and 
r=e", 
the normalized eye opening becomes 
Tf, — BE: =1-$- oe te res (27) 


Thus, /,,. = 0, and equation (13) becomes 
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p 2 2W =D | (1 - §) | 


ex SUE Gil ana 28 
N al 2et| ats a’ || ( ) 
AS exp (2a) — 1 
When a < 1, we may approximate equation (28) by 


ip as, ener | (29) 
- (2 a) 
2aa\"g + 5 


The error bounds for binary, 4-level and 8-level systems are plotted 
in Figs. 2, 3, and 4, respectively, as a function of the signal to noise 
ratio, S/o? , and the normalized reciprocal time constant, a. The dashed 
curves are the exact values for no low-frequency removal. 


_ 2N = 1) i 2 2(N 2) cp AS) 
(that in eS N- erfe Tele V erfe oe 
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where erfe (x) = au ei” it): 


It is seen that, in the region of 10° error probability, these exact 
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Fig. 2—Upper bound of the error probability of a binary standard format 
system with a single-pole high-pass filter. 
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Fig. 3— Upper bound of the error probability of a 4-level standard format 
system with a single-pole high-pass filter. 
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Fig. 4— Upper bound of the error probability of an 8-level standard format 
system with a single-pole nigh-pass filter. 
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curves differ from the corresponding bounds by approximately a 
factor of 10 in error probability, or 1 decibel in signal to noise ratio. 
In the binary case, it is seen that a simple high-pass filter with a 
time constant of 50 bit intervals introduces a degradation of only 
about 1 decibel in the region of 10- error probability. On the other 
hand, a time constant of 10 bit intervals leads to totally unacceptable 
performance. For the same amount of degradation and the same 
symbol rate, the 4- and 8-level systems must have high-pass time 
constants respectively 5 and 21 times that of the binary system. 


3.2 Cascaded Single Pole Filters 

In many cases, several single-pole high-pass filters are contained 
in the transmission path of the system. If n identical networks are 
used, then the overall high-pass transfer function is 


ee 

H,(s) = (= af i) j (80) 
In many cases, a transfer function containing a large number of real 
poles of different values can be approximated by a transfer function 


of the form of equation (80) .° 
The Laplace transform of the error signal is 


ao = te) ~ 1} 


To find g,(t), we first evaluate 
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At the sampling times, 








0, m <0 





—a exp (— ma) > @ i :) mo : m> 0 
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where again a = T'/r. 


LOW-FREQUENCY REMOVAL 267 


This function may also be expressed in terms of the generalized 
Laguerre polynomial,’ 


1@) = 2 > (0) oe 


(32) 
g.AmT) = = exp (— ma)LS-? (ma), m> 0. 


It has been found empirically in several numerical examples that 
the best error probability bound was obtained when all intersymbol 
interference terms were added to the noise (that is, Mot = 0). The 
resultant bound is therefore 


2-1), jo 
p 2 
. | 2at| % re (2 exp (— may” (ma) | 


An example of practical interest is the evaluation of the perform- 
ance of a baseband binary 50,000 bits per second data set without 
de restoration, operating over a transmission facility using trans- 
former coupled repeaters. The transformers each have a corner fre- 
quency of 15 Hz, and therefore a time constant of 


P, - (88) 


1 


T= Qn X 15 = 10.6 msec. 
so that 
210 
a= 0.0106. > 0.00188. 


The results of Fig. 2 indicates a degradation of only about 0.1 
decibel when a single transformer is introduced. However, several 
transformers are usually present in actual systems. The error signals, 
gn(t), and error probability bounds have been computed for both 14 
and 28 transformers. The error signals for these two cases are shown 
in Fig. 5. Remember that one millisecond is equal to 50 bit intervals. 


Figure 6 shows the error probability bounds for these situations; 
28 transformers lead to unacceptable performance while 14 transformers 
introduce a degradation of 3 decibels at 10~° error rate. It is significant 
that n transformers produce more degradation than a single transformer 
with a corner frequency 7 times greater. Also, under the assumptions 
of this paper, all of the above results apply independently of the roll-off 
characteristic of Q(w), as long as it is a member of the Nyquist I class. 
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Fig. 5— Errors signal for a cascade of transformers with 15 Hz corner fre- 
quencies. 
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Fig. 6— Upper bound of the error probability of a 50,000 bit/sec binary sys- 
tem with a cascade of-transformers with 15 Hz corner frequencies.  _—_ a 


LOW-FREQUENCY REMOVAL 269 


IV. ERROR PROBABILITY PERFORMANCE WITH AN N-LEVEL EXTENSION OF 
A DUOBINARY INPUT DATA SIGNAL 


The system model considered here is identical to Fig. 1 except 
that (7) a precoder which converts the input N-level sequence {ax} 
to another N-level sequence {b;,} according to the relation 


by = [@n-1 — ba-1](mod NV) (34) 


is inserted between the source and the transmitting filter, P(w), and 
(tt) a decoder follows the sampler which decodes the received levels 
modulo N to recover the original symbols a, . The important point for 
our application is that by including precoding at the transmitter, no 
knowledge of any symbol or sample other than the received sample, 
’, » iS involved in deciding a, . 

[Instead of the Nyquist shaping characteristic, the cosine filter is 


used for the composite signal shaping characteristic, Q(w) = P’(w), 
that is, 
Q@) = cos 5 = lw|s 7 (35) 


The system impulse response is given by 


_ 2 | _cosat/T | 
A) = oR F — Ae | 


so its values at the sampling instant are 


a k=0,1 
ql& — 3)T] = (36) 
0 k#0,1. 


The power of the transmitted ae is 


2 
20% 


g = Wide - “Ql de = oe (37) 


2rT 
where oj is the variance of b, . If the input symbols a, are equally likely 
and independent, then so are the precoded symbols b, . Thus, of = o? . 
The sampler input waveform, r(¢), may be expressed as 


ive) 


ri) = DF blot — bP) + e(t — TY] + nf) (38) 


where once again e(t) is the degradation caused by the low frequency 
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removal, H(w). Substituting equation (36) in equation (38), 
rim — 3)T] =r, = (6, + 6 ee b (—7) 
2 m m m=-1 oT m 9 
+ do bel(m — k — 2)T] + n[(m —4)T]. (89) 
kA~Am 
If H(w) is causal as before, then e(— 7'/2) will be zero. Making the same 


assumptions as in the standard signal format case, we arrive at an ex- 
pression for error probability analogous to equation (21) 


Seg t-W-) Dlall 
P, od (454) exp kek 
N 9 jee se 4 
aS kek de 


Here we consider only the single pole high-pass filter for H(w). The re- 
sult for a cascade of n identical poles follows immediately. 





(40) 





4.1 Single Pole Filter 
We start by examining the normalized eye. Letting 
Tf, = 1/2 
Tie Gel hee 12) 


(41) 


and r = e 
(N — Df, 1 (N — l)ae~* 


fo, 5 Tee (42) 


Thus, 1/7... = 0 and equation (13) becomes fora < 1 


2 

Poe o(X ++) exp yas a (43) 

eRe 
“\rS z, 

Figures 7, 8, and 9 illustrate the behavior of the error probability 
bounds versus S/o; for binary, 4-level and 8-level partial response 
signals with the normalized reciprocal time constant, a, as a parameter. 
The dotted curves give the exact values of P, for the case a = 0 


‘that By P= of - 1) erfe ((Sx/8)" | 


N? TaTn 
We once again observe that in the neighborhood of 10~° error proba- 
bility, the exact curves for a = 0 differ from the corresponding bounds 


by approximately a factor of 10 in error probability, or 1 decibel in 
signal to noise ratio. 
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SIGNAL /NOISE IN DECIBELS 


Fig. 7— Upper bound of the error probability of a binary partial response sys- 
tem with a single-pole high-pass filter. 











4 
SIGNAL /NOISE IN DECIBELS 


Fig. 8— Upper bound of the error probability. of a 4-level partial response 
system with a single-pole high-pass filter. 
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28 
SIGNAL /NOISE IN DECIBELS 


Fig. 9— Upper bound of the error probability of an 8-level partial response 
system with a single-pole high-pass filter. 


However, to achieve a S/N degradation of only 1 decibel in the 
region of 10° error probability with a simple high-pass filter, the 
time constant must be about four times that needed for the standard 
format signal. The above statement is true for the binary, 4-level, 
and 8-level cases. This more stringent requirement on the location 
of the low frequency cutoff may be viewed as a tradeoff for the saving 
in bandwidth associated with partial response signaling. 


V. CONCLUSIONS 


Although a high-pass filter will always close the eye pattern of 
? a standard format data signal (Nyquist I shaping) or 7 a multi- 
level partial response signal (duobinary format), the error probability 
may still be quite low for random data provided that the high-pass 
filter is sufficiently narrow. This effect permits the use of capacitor or 
transformer coupling in the data terminals or transmission facilities. 
Multilevel systems require a longer time constant for these networks 
than do binary systems for the same performance. 

Upper bounds of error probability have been given for binary, 
4-level, and 8-level systems with gaussian noise and a single-pole 
high-pass filter (exponential time response). A binary system with 
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a standard format input signal is degraded by only about 1 decibel 
by a simple high-pass filter whose time constant is 50 bit intervals. 
Four-level and 8-level systems require time constants of 250 and 
1000 baud intervals, respectively, for the same performance. 

A data system whose input is a binary, 4-level, or 8-level partial 
response signal must have a low frequency cutoff which is two octaves 
lower in order to achieve the same performance as a standard format 
system. 

The error signal for a multiple-order pole is an exponential multi- 
plied by a generalized Laguerre polynomial. The performance of a 
system with an nth order pole high-pass filter is worse than one with 
a single pole n times as large. 
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Computer-Aided Circuit Design 
by Singular Imbedding 


By E. B. KOZEMCHAK and M. A. MURRAY-LASSO 
(Manuscript received August 5, 1968) 


We give a new and powerful method for the direct solution of circutt 
design problems. The method begins with a prespecified topology and some 
or all elements undetermined in value. The designer imposes on the circuit 
any desired set of node-pair voltages, branch currents, or driving point 
and transfer immittances. Values of circuit elements that satisfy the con- 
straints are directly calculated. This direct method of solution avoids the 
usual iterative analysis-optimization schemes, reducing computer times by 
up to three orders of magnitude. 

A linear set of design equations is formulated by choosing undetermined 
element currents and node voltages as the variables. Singular elements are 
introduced to impose the desired constraints. Inequality as well as equality 
constraints are permitted. Element values are determined from the solution 
of these equations. In this paper we emphasize our method of solution in 
relation to de networks. 


I. INTRODUCTION 


The most significant advances made in computer-aided circuit de- 
sign have been in analysis programs. The designer can now choose 
from among several general purpose programs that program which 
most nearly suits his particular needs. In designing a circuit to meet 
a given set of requirements, the usual approach has been to use 
analysis programs in some optimization scheme. Through an iterative 
process, carried out by the machine, the man, or a man-machine inter- 
action, a final design is reached. The approach presented here pro- 
vides a direct solution, and does not rely on such iterative schemes. 

The method is most fertile in the area of active network design, 
where one often wishes to choose element values in a specified topology 
in order to meet some set of requirements. The method has been 
applied to a number of design problems of current interest including 
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biasing direct coupled transistor circuits; designing transistor ampli- 
fiers for specified midband gain, input, and output impedances; and 
simultaneously realizing several specified impedance or admittance 
parameters of a network. 

In the design of electronic circuitry, one usually wishes to imbed 
passive elements into a network containing active devices, and to 
determine the required passive element values. Therefore, this paper 
deals with the determination of element values in a _ prespecified 
topology for which a given performance is required. Two new ele- 
ments, a voltage forcing element (VFE) and current forcing element 
(CFE), are introduced in order to constrain network voltages and 
currents. These elements may be realized with independent voltage 
and current sources, and the nullator, a somewhat “pathological” 
element used in theoretical network studies. 

The method of singular imbedding places the VFE’s and CFE’s 
in a network to constrain the desired variables. The terminal voltage- 
current behavior of the variable elements is not specified. Instead, 
the constraints imposed upon the network by the VFE’s and CFE’s 
are used to determine allowed voltage-current relations for the variable 
elements. The formulation remains linear in these variables. The last 
step involves determining the element values through Ohm’s law 
once the allowed voltage-current relations are known. 

By appending the original set of equations with a set of inequality 
constraints, it is possible to restrict the range of element values in 
the solution. For example, realizations employing only element values 
between specified lower and upper bounds are possible. For simplicity, 
only the case of linear de networks are illustrated. Extensions of 
the method to ac and nonlinear design are considered elsewhere. 


II. A NEW APPROACH 


To understand the philosophy of this new approach to design, con- 
sider the train of events in realizing a set of requirements with elec- 
tronic circuitry. Since the choice of topology is better handled by the 
man than the computer, we will assume some specified topology in 
which some or all of the element values are to be chosen to meet the 
given criteria. For example, in designing transistor circuitry it is nec- 
essary to choose some resistance values to properly bias the transis- 
tors. Similarly, one must often choose element values to give a desired 
voltage gain, driving point impedance, transfer impedance, or similar 
network function. 
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The invariant feature in all of these problems is that a set of net- 
work currents and voltages, or their ratios, has been constrained. The 
design problem is to find any set of element values consistent with 
these constraints. If the problem is posed with sufficient freedom, 
many sets of element values may exist consistent with the imposed 
constraints. Conversely, if the problem is posed with insufficient free- 
dom, inconsistent equations arise and there is no solution. 

If one can find a general method of imposing these network con- 
straints, and can simultaneously monitor the voltage-current relations 
these constraints force at the terminals of the variable elements, then 
indeed a direct solution to many computer-aided design problems will 
have been found. 

Before proceeding, however, consider a very simple example of 
how one might presently handle the design problem and the diffi- 
culties that would ensue. Suppose in the network of Fig. 1, one wishes 
to choose G; and G». such that V’ is constrained to be 0.1 volt. A set 
of nodal equations may be written: 


GQ —-G | " _ a) 
—G, G, + Ge Vs 0 

The first step involves a transformation of coordinates so that 
the desired quantities appear explicitly in the equations. In general, 


this will necessitate using hybrid parameters. For this case, the fol- 
lowing transformation might be used: 


| _ ; 7 a 0) 
V3 0 LY; 


Inverting the relation, we have 


m = , i“ (3) 
Ve 0 vi 





‘Fig. 1— Simple design problem.” 
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and the current-law equations become 


t 
i +G, -G ik i : _ 1 (4) 
=G, G, + G, 0 1 Vv; 0 


ale 
-G, GJVi} 0 

Substituting the constant V’ = V', = 0.1, the set of equations becomes 

O11 +G) + V2, =1 

—0.1G, + GV2 = 0. 


Thus, even if one is successful in finding a transformation to a 
basis that includes the variables that are constrained, the result is 
usually a set of nonlinear equations in the network elements and 
voltage variables. Solving this set of nonlinear equations for the 
unknown voltages and element values is extremely difficult. A method 
of handling this difficulty has been suggested, involving the use of 
optimizing techniques to vary element values until the network vari- 
ables take on their desired values—in this case V’ = 0.1 volt.t While 
this is a useful approach, it has several disadvantages. First, it is 
time consuming since many iterations are required for convergence. 
Second, local minima, or lack of sufficient numerical accuracy, may 
prevent convergence to a correct solution. Finally, although an infinity 
of sets (G1, G2) exist to satisfy the given constraints, the optimization 
yields only one of these sets. 

With these difficulties in mind, let us repeat the philosophy of 
design presented here. We first determine how the requirements con- 
strain network currents and voltages. We then force these currents 
and voltages to take on the desired values. Finally, we determine the 
effect of such constraints upon the voltage-current relations at the 
terminals of variable elements. These v — 72 relations then determine 
the values of the variable elements. 


(6) 


III. NETWORK CONSTRAINTS 


The common feature of all network synthesis problems is that they 
require some specified relation between some voltages and currents in 
the network. For example, synthesis of a given driving point im- 
pedance constrains the ratio of a port voltage to the current at that 
port. Synthesis of a transfer impedance constrains the ratio of a port 
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voltage to the current at a different port. A specified voltage or cur- 
rent gain constrains the ratio of two port voltages or port currents, 
respectively. Indeed the synthesis of entire network matrices is a 
combination of such constraints. Similarly, the static design problem 
in electronic circuits involves fixing certain branch currents and 
branch voltages. For example, one usually wishes to bias a transistor 
for a given collector current and collector-emitter voltage. Resistance 
values are chosen consistent with these constraints. 

It is essential to demonstrate a method for constraining voltages 
and currents in a network. The required constraints are shown in 
Fig. 2. We introduce two new elements, a current forcing element, 
CFE(I,), and a voltage forcing element, VFE(V,), which will be real- 
ized with more conventional elements shortly. We want the CFE(J,) 
to be such that it constrains the current through branch j to be J,, 
without otherwise affecting the behavior of the network. We want the 
VFE(V,) to be such that it constrains the voltage across branch 7 
to be V, without otherwise affecting the behavior of the network. 


In discussing the properties of the CFE and VFH, we use the concept 
of admissible or allowed pairs of voltage and current variables (v, 7).” 
The set of voltage-current pairs that a system N allows can be used to 
completely describe that system.’ For example, let the system under 
consideration, Nz, consist of a single resistor of value R. Then the 
system is completely described by its allowed terminal voltage and cur- 
rent pairs; namely, (R72, 7) « Nz. Similarly, a capacitance of value C, 
denoted NV , is completely described by its allowed pairs (v, d(Cv) /dt) « 
Ne. 

We now define the CFE(I,) and VFE(V,) in terms of their allowed 
pairs. 

Current forcing element (J,): 


(0, I,) & Neve) : (7) 


Here we postulate an element which allows no voltage drop across 
its terminals, and passes only a specified current J,. 

Next, we postulate an element which allows only a fixed voltage 
VY, to exist at its terminals, and passes no current. 


Voltage forcing element (V,): 
(Vv. ? 0) € Nyrewv.) . (8) 


Figure 2 makes clear the use of these elements in constraining net- 
work variables. In Fig. 2a, the current in branch 7 is forced to be 
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BRANCH J 
Vo VFE (Vo) 





(a) (b) 


Fig. 2— Network constraints. (a) Branch J current constrained by current 

forcing element (CFE); (b) Branch J voltage constrained by voltage forcing 
element (VFE). 
I, by inserting a CFE in series. Since the CFE(J,) allows no voltage 
to exist across its terminals, its presence affects Kirchhoff’s current 
and voltage laws only to the extent that branch 7 current is con- 
strained to be J,. Notice that this would not be the case had we in- 
serted a current source in series with branch j. The current source 
would allow some voltage to exist between its terminals which would 
have been included in Kirchhoff’s voltage law equations. Thus, a 
current source of value J, would not only constrain branch j current 
to be I,, but would also introduce a new degree of freedom, namely, 
the voltage across the current source. 

Similar reasoning can be applied to Fig. 2b. Here a VFE(V,) is 
applied across branch 7 to constrain that voltage to be V,. Since the 
VFE(V,) passes no current, Kirchhoff’s laws are affected only to the 
extent that branch j voltage is now constrained to be V,. The net- 
work cannot respond with a new degree of freedom, as it could if a 
voltage source were placed across branch 7 and thus allowed to in- 
troduce a new current variable in Kirchhoff’s current law equations. 
It should be noted that the VFE(V,), can be placed between any 
two nodes to constrain the voltage between those nodes; it need not 
be placed across a branch. 

By using current sources and voltage sources in conjunction with 
VFE’s and CFE’s, current-voltage ratios may be constrained. For 
example, in Fig. 3a, 


Tits Vie A 
io pp (9) 
In Fig. 8b, 
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=ZI 


(a) 


Fig. 3— Methods of constraining current-voltage ratios. (a) Impedance forcing 
element [IFE(Z)]; (b) admittance forcing element [AFE(Y)]. 





Thus we are constraining the network N to have, in the first case, 
a driving point impedance Z, and in the second case, a driving point 
admittance Y. The configurations used to constrain impedances or 
admittances will be denoted impedance forcing elements, IFE(Z), 
and admittance forcing elements, AFE(Y). Notice that IFE’s and 
AFE’s are composed of CFE’s, VFE’s, and independent sources. They 
are useful in constraining a network to have a desired driving point 
impedance or admittance. 

We already mentioned that VFE’s and CFE’s could be realized in 
terms of existing elements. The necessary elements are the ideal cur- 
rent source, the ideal voltage source, and the nullator, a somewhat 
“pathological” network element introduced by Tellegen.* Returning to 
the allowed pair concept, the nullator is defined to be a two-terminal 
element for which the only allowed voltage-current pair is (0, 0). It 
can be looked upon as a simultaneous open and short circuit, since it 
allows only zero voltage at its terminals and passes no current. 

From its definition, one could not hope to physically realize and 
isolate such a device. However it’s characteristics may be observed at 
the input to an operational amplifier imbedded in a feedback net- 
work, where the input is at a virtual ground (short circuit) and yet 
passes no current (open circuit). The nullator is represented sche- 
matically in Fig. 4. 

By appropriate connections of voltage sources, current sources, 
and nullators, the VFE’s and CFE’s may be realized as in Fig. 5. 
Remembering that the nullator passes zero current and has zero volt- 
age across its terminals, the equivalents of Fig. 5 becomes clear. In 


ogee a 


Fig. 4— Schematic representation of nullator. 
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= [0]. 





(a) 
Fig. 5 — Equivalent circuits for VFE and CFE using nullators. 


Fig. 5a, the terminal voltage must be V,, and since no current exists 
in the element the combination voltage source and nullator is by 
definition a VFE(V,). In Fig. 5b, a current J, exists at the terminals 
but no voltage drop exists across the terminals. Thus by definition, 
the combination current source and nullator is a CFE(J,). 


IV. ADDING FREEDOM TO THE NETWORK 


In the previous section, we placed constraints on the network that 
would generally lead to a set of inconsistent equations if all the 
elements were also specified. However, if some network elements are 
variable, we can determine how the constraints affect the voltage- 
current relations at the variable element terminals, and then choose 
variable elements in such a way as to be consistent with these v — 
2 relations. 

We propose two methods of characterizing the variable elements. 
First, since the element is variable, we can ascribe no functional rela- 
tion between the voltage and current of that branch. This is handled 
in writing the nodal equations for the network by explicitly adding 
the currents through variable elements into the equations, rather than 
first transforming them into voltage variables through a functional 
relation of the form 


ty = Yuu, (11) 
where the b implies the variable refers to some branch. The nodal 
equations are of the form 

[Y;]V] = Is] + [CI], (12) 
where 


V] is an n-vector of node voltages. 
I,] is an n-vector of forcing currents at each node. 
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[Y;] is the n X n nodal admittance matrix of the fixed portion of the 
network. 
I] is an r-vector of unknown currents through variable elements (r 
is the number of variable elements). 
[(C] isthen X rnode cutset matrix for the graph of variable elements. 
I] and V] are both vectors of network variables, and may be combined 
by matrix partitioning as 


[—C] ¥dJ BE =m ep (13) 


Equation (13) describes a network in which some element values 
can be chosen to meet the given constraints. In the remainder of this 
paper, we combine the added degrees of freedom given by the variable 
elements in equation (13) with the constraints imposed by the CFE’s 
and VFE’s. All networks, satisfying the VFE and CFE constraints 
and the specified topology, with be generated. 

A simple example will help clarify these concepts. Figure 6 is the 
network of Fig. 1, with the 1-ohm resistor replaced by a known resis- 
tance of R ohms. Currents J, and I, are those carried by the variable 
conductances G', and Ge, respectively. The set of nodal equations is 


ce al = 1 a ' ‘1 at (14) 
0 0 Ve 0 1 —l I, 
Rearranging into the form of equation (18), 


q, 


1 
eer aa (15) 
—-1 1:0 O/Y,z 0 
Vo 


From this example, the method of generating equation (13) should 
become clear. 


ed) {ts 


Fig. 6—Simple design problem. 
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The second approach useful in dealing with variable elements in a 
network is the introduction of another pathological element, the nora- 
tor, shown in Fig. 7, also introduced by Tellegen.t The norator is a 
two-terminal element with allowed pairs (v, 7), with v and 2 independ- 
ent and arbitrary. Thus, any voltage and current may appear across 
its terminals simultaneously, which is the property that we desire of 
variable elements. We do not wish to force any functional relation 
between the voltage across and the current through variable ele- 
ments. We wish only to observe constraints that may be imposed on 
the v — 7 relations by the VFE’s and CFE’s. The norator allows the 
network the extra degree of freedom taken away by the introduction 
of nullators. 


V. FORMULATION OF NETWORK EQUATIONS 


Since the introduction of nullators and norators into a network will 
generally introduce singularities into the corresponding equations, we 
call the approach we are considering the method of singular im- 
bedding. It has been demonstrated that the design problem can be 
reduced to the appropriate imbedding of nullators, norators, and in- 
dependent voltage and current sources. Let us now examine the effect 
of such imbedding on the network equilibrium equations. Since a 
nodal admittance formulation is used, it is important to determine 
the effect of nullators and norators on the admittance matrix. 

Independent voltage sources may be conveniently incorporated into 
an admittance formulation. If a series impedance exists with the 
voltage source, application of Norton’s Theorem is sufficient. If no 
series impedance exists, the introduction of positive and negative im- 
pedances is necessary in transforming the voltage source to an inde- 
pendent current source (see Fig. 8). 

The effect of nullators and norators upon the admittance matrix 
of a network has been considered by A. C. Davies.' Let us write the 
nodal equations for the network with all poner and norators re- 
moved. The equations are of the form 


[Y.IV] = 15] (16) 


Tig. 7— Schematic representation of norator. 
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Fig. 8 — Equivalent circuit for ideal voltage source. 


where 


[Y,] is the admittance matrix of the network with nullators and 

norators removed 

V] is the vector of node voltages with respect to ground 

I,] is the vector of currents injected into each node. 

Suppose now that a nullator is connected between nodes 7 and j. 
Since the nullator passes only zero current, the current law equations 
at those nodes are not affected. However, since there is zero voltage 
across the nullator, V; and V; are now constrained to be equal. Call 
this new value V;;. Clearly, one degree of freedom has been removed 
from the network response. In addition to the matrix equation (16), 
one equation of the form 


V;= V; (17) 
is added for each nullator imbedded in the network. Thus, if k nulla- 
tors are imbedded, k additional constraint equations are added. 

Two viewpoints can be taken here. First, the original set of equa- 
tions, equation (16), has been appended by a set of the form 
[B]V] = 0] (18) 
where 


V] is the n-vector of node voltages. 
[B] isa k X n matrix of —1, 0, 1 entries expressing the set of con- 
_ straints of equation (17) for the k nullators. 


The final set of equations becomes 


E}n-"} we 


ake second approach to the- problem was suggested -by Davies.° In. 
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the nodal equations below 
V, | 


Yun ott Yai ttt Yas tha ly 


\ Yor Yo: Yo ; : =z Ts] (20) 
. V, 


Al 
the addition of a nullator between nodes 2 and j makes V; = V; = Vij. 
The zth and jth column of the Y matrix are both multiplied by Vi;, 
thus they may be added and the equations written as 








LYn1 eee Yni eee Ynj FY Uae 





V, | 


Ye? (Yui bY) oot Yin 


Yai (Yo: + Yoi) Yon V.. = Is]. (21) 








Yai (Yni ae Yni) siete Yan y 


The addition of k independent nullators (no nullator loops) causes k 
additions of columns of Y and reduces the dimension of V] by k. We 
denote the reduced set of equations by 


LY lax in—e VV en-eyx1 = Ts|nxa- (22) 
In either interpretation, we observe that the resulting set of equa- 
tions is no longer square. In the first interpretation, we are increasing 
the dimensionality of the vector space that the column vectors must 
span, without adding new basis vectors to span that space. In general, 
the equations will be inconsistent. In the second interpretation, we are 
keeping the dimension of the space fixed, but reducing the number of 
vectors available to form a basis and the space may no longer be 
spanned. Again inconsistencies will generally arise. In either interpre- 
tation, the inconsistencies are to be expected since nullators (VFE’s 
or CFE’s) have been introduced to constrain network variables. 
Let us now examine the way in which variable elements (additional 
degrees of freedom) remove these inconsistencies. Again two points of 
view may be taken. One provides us with new basis vectors to span 
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the space of possible injected current vectors Is], the second reduces 
the dimensionality of the space of Is] in order that the existing number 
of basis vectors might again span the space. 

Section III gives the essence of the first interpretation with the 
important result, equation (13). Observe that imbedding variable 
elements in a network provides an additional set of column vectors, 
namely, those of [—C], that may be used as basis vectors in spanning 
the space of possible Is]. Thus, if one has complete freedom in selecting 
variable elements, a set of column vectors, the columns of [—C] can 
always be found to assure that the space of all possible Is] will be 
spanned, regardless of how the nullators reduce the space of the column 
vector of the Y matrix. This concept, which involves growing new 
elements to satisfy imposed constraints, will be the subject of future 
study. 

A second approach in handling the freedom introduced by variable 
elements is to replace each variable element by a norator, as suggested 
in Section III. The method of Davies may then be employed to analyze 
the network containing norators.° Again assume that the admittance 
matrix Y, of the network without nullators and norators is available. 
Thus, 


[Y.]V] = Is]. (23) 


Now suppose that a norator is connected between nodes h and k, and 
that the reference direction for the arbitrary norator current I, is 
from h to k. The current-law equations for nodes h and k will be of 
the form 


Ish — ie 


I 


pe Yi: (24) 


J + is = Dy Yous. (25) 


Since J, is arbitrary, and is not needed to solve for the node volt- 
ages, adding the two equations gives 


Is, + Is. = > (Yui + Vousdo:. (26) 


This corresponds to the addition of rows h and k of the nodal equa- 
tions of the network without norators. Thus for a network containing 
n nodes and r norators, only n — 1 — r independent equations can be 
written. 

Observe in Fig. 9 that the effect of connecting the norator between 
nodes h and k is to replace the nodal equations for nodes h and k 
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Fig. 9 — Effect of connecting norator between two nodes. 


by a single current law equation for the ambit (broken line) sur- 
rounding both nodes h and k. Thus any functional relation between 
the current and voltage of branch j is removed, as is desired for a 
variable element. 

To summarize thus far, the following manipulations may be per- 
formed on the network current law equations to deal with VFE’s, CFE’s 
and variable elements. To include network constraints, first imbed the 
CFE’s and VFE’s. Write the Y matrix with nullators removed. Then 
reduce the matrix by adding appropriate columns. This may be stated 
compactly by a matrix transformation as® 


Is] = [Y.][U.]V] (27) 


where [U,] is a matrix obtained from the unit matrix by adding columns 
corresponding to nodes between which nullators are connected. Since 
the transformation [U,] is singular, not all components of V] are deter- 
mined. The undetermined ones are found from the relation 


[B]V] = 0]. (28) 


To include variable elements, either 
(c) Augment the Y matrix of the fixed portion of the network with 
the node cutset matrix of the graph of the variable elements to get 


[-clyg|=1) (29) 


or 


(272) Add the current law equation ‘corresponding to nodes to which 
a nullator is connected. This is compactly stated by a matrix transforma- 
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tion as° 
[U,]I;] = [U,][Y.]V] (30) 


where [U,] is a matrix obtained from the unit matrix by adding rows 
corresponding to nodes between which norators are connected. The 
vector of currents through variable resistors is then formed by equation 
(29). 


VI. SOLVING THE NETWORK EQUATIONS 


We now wish to solve the set of equations after imbedding CFE’s, 
VFE’s, and variable elements. We assume equation (29) to be our 
starting point. A similar formulation may be made using equation 
(30) as the starting point. CFE’s and VFE’s are imbedded, variable 
elements are specified, and nullators are removed to generate the set 
of equations 


I 
[36.).¥,) 2| = Ig]. (31) 
Addition of nullators to the network adds the set of equations 
BY] = 0] (32) 


and, from equation (27), the corresponding transformation [U,] on 
the admittance matrix. Thus the final set of equations becomes 





0 


! 

0: B 

As seen in the previous section, the transformation [U,] (which adds 

columns of Y;) is consistent with the set of equations [B]V] = 0. Thus 

the second matrix equation in equation (33) will always have a solu- 
tion, provided the first one does. It remains only to solve 


[—Ci Y,U] z] = Ig]. (34) 


—C Y;U, Is 
il 4 


in order to determine the proper element values. Let 


q - x 
V (r+n—k)X1 ; 


By using the Gauss-Jordan method one can bring these equations 


into the form 
Ui Q| %| _ 1 
oI 0.) X%} ~ Ivo (35) 
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where 


X,|X.] is a vector of node voltages and currents through variable 
resistors, 
[U] is the unit matrix, 
[Q], Isi], Is2] are the resulting submatrices after transformation. 
If Is.] = 0 (the equations are consistent), the first equation can be 
solved for X,] in terms of X,]. 


x, = Is,] a [Q]X.]. (36) 


The case Is.] + 0 implies that there are no values of variable elements 
consistent with the imposed constraints. For Is.] = 0], equation (86) 
generates all solutions to the problem. Some network variables X,] 
can be chosen arbitrarily and the remaining variables X,] determined. 
At each setting of X,] the variable elements can be determined since 
all node voltages and currents through variable elements are known. 


Thus 


Zz, = a5 te for “a@=1,r (37) 
where 71 and 72 are connection nodes of the zth variable element. By 
allowing the free variables X,] to take on a continuum of values, all 
solutions to the problem are determined directly. 

Returning to the example already discussed (Fig. b), let us apply 
the method of singular imbedding. The circuit is redrawn in Fig. 10 
with the introduction of a VFE to constrain the voltage between nodes 
1 and 2 to be 0.1* volt. With the nullator removed, a set of nodal 
equations is written in the form of equation (13) 


I, 
1 01 1/R 0 O|l 1 
-11;0 1-1 V1 00|. (38) 
00:0 -1 IY, 0.1 

V3 


The introduction of a nullator between nodes 1 and 3 results in the 
addition of the corresponding columns and the equality V; = V3 = 


* Since the nullator passes zero current, the series battery in the VFE model 
may, have a nonzero resistance and still ‘maintain the proper terminal voltage. 
Thus the introduction of positive and negative resistances are unnecessary here. 
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Fig. 10 — Network after singular imbedding. 


Vi3. Thus, 
r1o0uR o]” 1 
Set a aS Sor (38b) 
oO. Tea 0.1 
Ve I 


With R = 1 ohm for ease of visualization, elementary row operations 
yield 


100! 1 A | gg] 
01 01 1}? 05 (39) 
oo Tat et) <0] 
VY; 
Thus, 
I, 0.9 1 
I,{=09]—V, 1}: (40) 
Vel 02 = | 


It is clear that Vj can take on arbitrary values while maintaining the 
constraints. We will demonstrate this for two particular values of 
Vo. For V2 =0 
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iF 0.9 

I, | _ 0.9 

Vis 0.1 

V2 0 

a Ve aed 
ia I, ~ 9? 
R, = ws — 0. 

1 


It is easily verified that these values, when substituting into the 
circuit of Fig. 1, result in V’ = V; — Ve. = 0.1 volt. 
Similarly for V2 = 0.6 volt 


Ir] 03 
I, | _ 0.8 
Vas} 202 
Vz} 0.6 


and R, = Vy, 2 - 

Again it is easily verified that V’ = V,; — V2 = 0.1 volt. With this 
simple example in mind, let us consider the solution of more com- 
plicated networks by computer. 


VII. COMPUTER SOLUTION 


A program has been written to solve the design problem for resis- 
tive networks. The program performs the following operations 

(2) Accepts input of circuit description in conversational mode. The 
circuit may contain resistors (both fixed and variable), VFE’s CFE’s 
batteries, independent current sources, and current controlled current 
sources. 

(77) Generates C, Y; , and Is matrices for the network. 

(77) Reduces equations to triangular form by a Gaussian reduction 
which pivots around largest elements in array. 

(tv) Those variables not in the basis after gaussian elimination are 
passed to the right side and stepped through specified range. Resistance 
values are printed for each setting of the free variables. Each set of 
resistance values will satisfy the given constraints. 

Four examples demonstrate the flexibility of the method. Suppose 
in the circuit of Fig. 11 one wishes to choose R, and Rez to provide 
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12V 
1260 Ro 


SET Ip =5MA 


R, 10000 


Fig. 11— Transistor design problem. 


a collector current of 5 mA. A CFE of value 0.005 is placed in series 
with the collector and the circuit of Fig. 12 is fed into the program as 
in Table I. After the program sets up the equations and performs the 
gaussian elimination, it prints, that the voltage at node 3 is free. It 
can be arbitrarily chosen to generate sets of solutions. 

This free voltage is then, at the user’s request, stepped from 7 volts 
to 10 volts in 1 volt increments. Combinations of R; and R2 which 
provide a collector current of 5 mA are printed in Table I. To verify 
these results the program DCANAL’ was used to determine the transistor 
collector current for the fifth set of resistor values. As the table shows, 
the collector current is 5 mA. 

A second example involves simultaneously constraining J, = 5 mA 
and VCE = 5 volts. As Fig. 138 shows, R;, Re, and Rz are variable. 
The network with a VFE and CFE imbedded is shown in Fig. 14, and 
the results given in Table II. Verification of the first set of resistance 
values is given. Observe that J, = 5 mA and VCE = 5 volts. 





{2V — 
{260 
200 
© 
= 
in 
a 6). 


Fig. 12 — Network after transistor modelling and singular imbedding. 
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TABLE I—PRINTOUT OF THE SESSION TO SOLVE THE CirculItT oF FicurE 11 


ae NO. OF BRANCHES, NODES,CONTROLLED SOURCES, BATTERIES, CURRENT SOURCES 
A=7 513480 

TYPE BRANCH RESISTANCES 

Brl, J. 1260, 200. 1.E£3 1.E4 1. 

TYPE FOR EACH BRANCH: INITIAL NODE,FINAL NODE,BATTERY NO. 

C-1 5 2 431 152 523 $2) 321 142 

TYPE VALUES OF BATTERIES 

D2@. 12, -.7 

TYPE FOR EACH CONTROLLED SOURCE: BRANCH NO. AND CONTROLLING BRANCH NO. 
E-6 4 

TYPE VALUES OF BETAS 

F=75, 


OPTION COMMANDS=DESIGN R 


TYPE NO, VAR. RESISTANCES,NO, VOLTAGE CONSTRAINTS, AND NO CURRENT CONSTRAINTS 
TYPE BRANCH NO. OF VARIABLE RESISTANCES 
TYPE BRANCH CURRENTS BEING CONSTRAINED 
TYPE VALUE OF EACH CURRENT BEING CONSTRAINED 
= oU 


THE FOLLOWING NODE VOLTAGES ARE FREE 
3 


ENTER LOWER LIMIT AND INCREMENT FOR EACH FREE VARIABLE AND NO, OF SETTINGS 
Oz7, 1. 4 


THE FREE VARIABLE: 7, 
RC $)21.1859722E+03 
RC 2)=9,9908005E+02 


THE FREE VARIABLE= 8. 
RC 1)=21 1B 41104E+03 
RC 2)=7,9980003 E+02 


THE FREE VARIABLE: 9, 
RC 1)21,1831496E+23 
RC 2)=5, 99800000 E+62 


THE FREE VARIABLE= 10. 
RC 1)=1,1821898E+83 
RC 2)23.9899998 E+02 


DESIGN COMMAND=KEEP 
ENTER VALUES OF FREE VARIABLES FOR DESIRED SET 
716, 


OPTION COMMAND=TRAN ALL 
VCE rc 


TRANS # 
! 4,9398562 4,9999999E-03 


12V 
Rs 
(1 SET Ig =5MA 
Ay Ve E=5V 
Ry 
Ra 


Fig. 13 — Transistor design problem. 
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® 


Fig. 14 — Network after transistor modelling and singular imbedding. 


A third example involves the rather complex three transistor circuit 
illustrated in Fig. 15. The imbedding of VFE’s and CFE’s to con- 
strain collector emitter voltages to 5 volts, and collector currents to 
10 mA is shown. 

Table IIT illustrates the results of a computer solution to the problem 
by the method of singular imbedding. Observe that currents through 
variable resistors 10 and 14 can be arbitrarily chosen and sets of resistors 
Ry. through R,, generated. Four such sets are presented in Table ITI. 
Observe the results of an analysis indicating one such set properly 
biases the network. Table IV presents the results of an optimization 
program, based on pattern search,’ to bias the network, for which 
forty-eight exploratory moves and 105 pattern moves were required. 
Each exploratory move involves between eight and 16 circuit analyses. 
Kach pattern move involves an average of four analyses. Thus, approxi- 
mately 1000 matrix inversions are required. Since each inversion involves 
(n*)/3 operations, the number of operations to generate a single bias 
network 243,000. 

Singular imbedding increases the number of nodes from 9 to 15. 
However, only one matrix inversion is required to generate a solution. 
Thus the number of operations 2n°/3 & 1125. 

Singular imbedding increases the efficiency in finding a solution to 
this problem by a factor of approximately 200. What is even more 
important is the ease with which equivalent networks are generated. 
Tach equivalent network is generated by a matrix multiplication of 
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TasBLE [I—PRINTOUT OF THE SESSION TO SOLVE THE CiRcUIT OF FIGURE 13 


Pe OF BRANCHES, NODES,CONTROLLED SOURCES,BATTERIES,CURRENT SOURCES 
$ 130 

TYPE BRANCH RESISTANCES 

Bzl. !. 1. 1268. 200. 1.E4 I. 

TYPE FOR EACH BRANCH: INITIAL NODE,FINAL NODE,BATTERY NO. 

C=1 3! 121 5 41 132 3:23 421 152 

TYPE VALUES OF BATTERIES 

D=0. 12. °-.7 

TYPE FOR EACH CONTROLLED SOURCE: BRANCH NO. AND CONTROLLING BRANCH NO, 
Es6 5 

TYPE VALUES OF BETAS 

F275. 


OPTION COMMANDS=DESIGN R 
TYPE NO. VAR. RESISTANCES,NO. VOLTAGE CONSTRAINTS, AND NO CURRENT CONSTRAINTS 
tepe SRANGH NO. OF VARIABLE RESISTANCES 
tor ahch VOLTAGE CONSTRAINT, TYPE PLUS AND MINUS NODES 
TYPE VALUE OF EACH VOLTAGE CONSTRAINT 
nes BRANCH CURRENTS BEING CONSTRAINED 
TYPE VALUE OF EACH CURRENT BEING CONSTRAINED 
ze 


THE FOLLOWING NODE VOLTAGES ARE FREE 
3 


ENTER LOWER LIMIT AND INCREMENT FOR EACH FREE VARIABLE AND NO. OF SETTINGS 
B=4_. 2. 2 


THE FREE VARIABLE= 4. 

RC 1)26.3601035E+92 

RC 2)=6 24979250 E+02 

RC 3)27.4140005E+82 

THE FREE VARIABLE=6. 

RC 1)21.2768788 E+03 

RC 2)21.0450373 E+B3 

RC 3953.41 4090! E+ 82 

DESIGN COMMAND=KEEP 

ENTER VALUES OF FREE VARIABLES FOR DESIRED SET 

4. 

OPTION COMMANDS=TRAN ALL 
VCE Ic 


TRANS # 
! 5 8000 E+99 5.8081 E-83 


the vector of free variables, which is stepped through a specified 
range, and the matrix of vectors not taken into the basis after tri- 
angulation. In this case the matrix is 19 x 2 and the vector of free 
variables is 2 X 1. Each multiplication involves 2 X 19 = 38 opera- 
tions. This means that up to 14,000 equivalent networks can be gen- 
erated with the same number of operations needed to give one solution 
by optimization techniques. 

The value of singular imbedding is apparent here. Only one equa- 
tion need be solved, and from it, all solutions are generated. 

As a fourth example, a network was designed for a specified 21; 
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CFE (0.01) 





-45V 


ee: 15 — Three transistor network with VFE’s and CFE’s imbedded for desired 
iasing. 


and Za; simultaneously. The circuit is given in Fig. 16. Ri, Re, and Rs 
are to be selected to give 21; = 74 and 2a; = 144. After proper imbed- 
ding of VFE’s and CFE’s the network of Fig. 17 results. Table V 
gives the results of a computer run to design the circuit. The third 
set, Rj = Re = Rz = 2 is shown to give the desired z-parameters 
through the Y — A transformation of Fig. 18. 


VIII. RESISTOR CONSTRAINTS 


In many design problems it is desirable to constrain the values that 
the parameters take to lie within certain limits. For example, in 
biasing a transistor network, although solutions in which some resis- 
tors are negative are mathematically correct, in practice such net- 
works are unacceptable. 

If the designer has a good feeling for the circuit he is working 
with, his choice of the free variables resulting from gaussian elimina- 
tion with maximum pivoting will usually yield resistors with posi- 
tive values. There are, however, instances involving multiple feed- 
back paths where intuition cannot always be relied upon. In these 
instances it is possible that the values given by the designer to the 
free variables yield negative resistances. Furthermore, it may be 
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TABLE IJJ—PriIntToutT OF THE SESSION TO SOLVE THE CIRCUIT OF 


Figure 15 
THE FOLLOWING BRANCH CURRENTS ARE FREE 
10 
14 


ENTER LOWER LIMIT AND INCREMENT FOR EACH FREE VARIABLE AND NO, OF SETTINGS 
Os$.E+3 1.E*3 L.E-2 .2E-2 2 


THE FREE VARIABLES ARE 
1 B60 E-83 1.608E-02 
RC1G)=24,.2366667E+03 
RCL) =1.1628088E+09 
R(12)=3,.7088484E+83 
R(13)=4.0579244E+03 
R(14)=4,1685477E+03 
RC16) 232378445 4E+93 
RC17)23.7811070E+O3 
R(18) 24.191 7495E+03 


THE FREE VARIABLES ARE 
2 O20 E-03 1.808E-82. 
RC18)22.1183333 3 
RCLL=SLIST2TICGE+89 
RC12)=3.9956586E+03 
R13) =23.7284589E+83 
R(14)24, 1685477 E+03 
RC16)=3.3704454E+03 
R(17)23.7811B70E+83 
RC18) 24.191 7495 E+03 


THE FREE VARIABLES ARE 
1. 008E-03 1.200E-02 
RC(10) =4,2366667E+03 
R(11)22.1183293 E403 
R(12)=3,7008484E+03 
"RC13)=4.9290368E+03 
R614) 23.473 7897E+63 
R16) =3,3704454E+03 
RC17)=3.7811070E+03 
RC18) =4.191 T495E+03 


THE FREE VARIABLES ARE 
2.G00E=03 1.200E=-02 
R18) =2.1183333 E03 
RCL1)22.1183293E+03 
RC12)23.9956586E+83 
RC13) =4.4512615E+03 
R614) 23.473 7897E+03 
RO16) 23.3704454E+93 
R17) 23.781 1B7CE+O3S 
RC18) 24.191 7495 E+93 


DESIGN COMMAND=KEEP 
ENTER VALUES OF FREE VARIABLES FOR DESIRED SET 
22.E*3 1,E-2 : 


OPTION COMMANDS=TRAN ALL 


VCE Ic 
TRANS # 
i 5 -OBGGE+BG 1.0800E-02 
2 5 OG0GE+28 1,9008E-02 
3 5 OGBE+0B 9-9999E=83 


difficult to explore the space of the free variables looking for regions 
where all the resistors are positive. 

One possibility for finding positive resistor regions is to use an 
optimization technique in which, considering the free variables as 
adjustable parameters, the sum of the absolute magnitudes of the 
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negative resistors is reduced to a minimum. If there exist solutions 
with all resistors positive, the minimum (zero) hopefully would be 
found automatically by the optimization routine. 

This optimization is more efficient than solving the problem by 
exploring a space in which all the variable resistors are parameters 
to be adjusted. 

Although the method given has been tried with success, a superior 
method having several advantages over the one proposed is explained 
in Section IX. The method avoids some of the most important problems 
associated with nonlinear programming. 


Some of these problems are: 


(t) The routine may get trapped in local minima. 

(11) Depending on the shapes of the surfaces involved and on the 
methods used the convergence towards the minimum may be very slow. 

(wz) If the optimization is with constraints the nonlinear constraints 
are usually difficult to handle. 
If it were possible to reduce the problem to a linear programming 
problem, the following would have been gained: 

(i) If the problem has a finite minimum it will be achieved in a 


TasBLE [V—PRINTOUT OF OPTIMIZATION PROGRAM 


INITIAL BRANCH RESISTANCES 


R(10)=0.50080E+B4 
R(11)=0.5000E+04 
R(12)=8,3800E+04 
R(L3) =8.3080E+04 
R14) =8.3800E+04 
RC16) =8.3800E+B4 
R(17) =8.3000E+04 
R(18) =8.3800E+94 


EXPLORATORY MOVES 48 
PATTERN MOVES 185 
FINAL BRANCH RESISTANCES 


R(18)=28.3730E+64 
ROLL) 28,4340 E+04 
R(12) 28.3732 E84 
RC13)=8.4377E+84 
R14) =8.3795E+94 
R(16)=8.3372 E+04 
R(17)=8.3783 E+34 
R(18)=8.4197E+94 


TRANSISTOR OPERATING POINTS 


TRANS. # VCE Ic 
! 5.000E+09 1.808E-02 
2 5. 8B0E+08 1.8@38E-82 


3. 5.880E+08 - 1,.8088E-82 
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Zoy= 1/3 





Fig. 16— Z-parameter design problem. 


finite number of steps. No local minima which are not also global 
minima exist. 
(it) Algorithms exist which converge to the minimum efficiently. 
(iit) The linear constraints generally complicate the problem only 
moderately. 


In Section IX the problem of biasing transistor networks is reduced 
to a linear programming problem. 


IX. APPLIED LINEAR PROGRAMMING 


Let us start by assuming a network in which the designer knows 
the correct signs of the node voltages with respect to the datum and 
the direction of the currents in the variable resistors. Generally the 
former is an easy task since it only involves knowing the nodes with 
the lowest potential. If this node is chosen as the datum, all the node 
voltages will be positive. Knowing the correct direction of the cur- 
rent through the variable resistors requires a better understanding of 
the circuit operation. Furthermore, there may be solutions in which 
the current through some resistors may flow in either direction. For 
this reason this requirement will eventually be relaxed. 

Linear programming requires the right side vector of equation (33) 


R3 





VFE (1/3) 






Fig. 17 — Network after singular imbedding 
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TABLE V—PRINTOUT OF THE SESSION TO SOLVE THE CIRCUIT OF FIGURE 16 


TYPE NO, OF BRANCHES, NODES,CONTROLLED SOURCES,BATTERIES,CURRENT SOURCES 
Ax6 49011 

TYPE BRANCH RESISTANCES 

Bzl. Le le e66666667 66666667 «66666667 

TYPE FOR EACH BRANCH: INITIAL NODE,FINAL NODE,BATTERY NO, 

cz! 21 i4f 241 231 341 311 

TYPE VALUES OF BATTERIES 

D=8. 12. 7.7 

TYPE FOR EACH CONTROLLED SOURCEs:BRANCH NO. AND CONTROLLING BRANCH NO, 
Es6 5 

TYPE VALUES OF. BETAS 

F275. 

TYPE FOR EACH INDEPENDENT SOURCE: INITIAL NODE AND FINAL NODE 

G=t 2 

TYPE VALUE OF EACH INDEPENDENT CURRENT SOURCE 


sie 


OPTION COMMANDS=DESIGN R 

TYPE NO. VAR. RESISTANCES,NO. VOLTAGE CONSTRAINTS, AND NO CURRENT CONSTRAINTS 
I-32 8 

TYPE BRANCH NO. OF VARIABLE RESISTANCES 

Jz1 2 3 

FOR EACH VOLTAGE CONSTRAINT TYPE PLUS AND MINUS NODES 

Kz2 1 41 

TYPE VALUE OF EACH VOLTAGE CONSTRAINT 

L=.66666667 .33333333 


THE FOLLOWING BRANCH CURRENTS ARE FREE 

1 
ENTER LOWER LIMIT AND INCREMENT FOR EACH FREE VARIABLE AND NO. OF SETTINGS 
Oz-l. 633333333 3 


THE FREE VARIABLE = -1.8000 
RC 1)= B.6666667B0E+88 
RC 2) =-8.66666667 E+980 
RC 3)5-0,66666680E+05 


THE FREE VARIABLE = -0.6667E+00 
R€ 1)= 6.18080000E+01 
RC 2)=-8,.22000808E+H1 
RC 3) 2-0, 19999999E+B1 


THE FRE VARIABLE = -@,3333E+00 
RC 1)= B.1999999T E+ 1 
RC 2)= B.28008000E+B! 
RC 3)= 0.208608013E+81 





Z4y=2/3 
Zo,= 1/3 
{ 
—— { { 
(a) (b) 


Fig. 18 — Verification of computer solution. 
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to have positive entries. This may be achieved by multiplying by —1 
all those rows in equation (33) which have a negative entry in the 
right side vector and thus obtain the set of equations 











H, | H,/ I f 
—|— )-|=- (41) 
0 |H,] V. 0 
where 
H, | H, 
H =|— 
is obtained from 
—C]} Y,U,. f I; 
— and —-| from — 
0 B 0 0 





by possibly multiplying some rows by —1. 
To force all the branch voltages to be positive let us add the constraint 


—[C]V] 2 0] (42) 


where [C] is the matrix appearing in equation (12). 
Equation (41) and inequality (42) together with the condition 


2| =o] (43) 


can be looked upon as a linear programming problem in which it is 
desired to find the value of a positive vector satisfying a set of linear 
equalities and inequalities and which minimizes the linear function 
where 


z=D T (44) 


D = [0, 0, ioe , O}. 
Since the minimization of the constant zero is of no interest, all that 
is required is to obtain the feasible solutions of the linear programming 
problem.° 
Once the feasible solutions are obtained, the fact that the solution 
satisfies equation (41) guarantees that the circuit is properly biased 
while the positivity condition on the vectors I|V and —[C]V] guarantee 
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that all the variable resistors are positive, since both the currents 
and voltages across them are positive. 

To obtain the feasible solutions phase I of the two phase simplex 
method may be used.° 

Phase I of the simplex method finds the basic positive solutions of the 
system of equations 


0 | H; 0 





(45) 


1 <ot om 
ll 
| Oo | mh 


0 |—-C|—Ujw 0 
where the vector w] (which is constrained to be positive) is a slack 
vector and U is a unit matrix. 
By denoting with A the matrix on the left of equation (45), with x 
the column on the left, and with b the column on the right side, equa- 
tion (45) may be written 





Ax = Db. (46) 


Let the dimensions be: A, m X n;x,n X 1;b, m X 1. Let A and [A | b] 
have rank r. This implies equation (46) is compatible. (The case in 
which this is not true is of no interest since in such case no solution— 
whether positive or not—exists.) 

Phase I of the simplex method finds positive solutions of equation 
(46) for r of the variables x;,1 = 1, 2, --- , 7 setting the rest of the 
2j,j7 =rt+1,--:- , nto zero.* Each one of this set is a basic feasible 
solution. There may be several such sets for a given problem. The 
totality of nonnegative solutions of equation (46) is the convex hull 
of the basic solutions. By extending the simplex algorithm so that 
once a basic feasible solution is found the other basic feasible solu- 
tions are also searched for, it is possible to obtain all basic feasible 
solutions. 

Suppose z?, 2?, -:- , x? are basic feasible solutions. Then any vector 
x satisfying 
aa ze = yet + de® bee al 

Meg hegre 4 Ap 0 | (47) 


Ar tre tices HAP Hl 


is also a feasible solution. 


and 


_ *In case no nonnegative solutions to equation (46) exist, the simplex algorithm 
is able to detect it. 


304 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1969 


If zt, 27, -++ , 2? is the set of all basic feasible solutions, then all the 
solutions of equation (47) constitute the complete set of feasible solu- 
tions. 


X. RESISTORS WITH UPPER AND LOWER BOUNDS 


In the previous discussion the appearance of nonnegative resistors 
was precluded by adding inequality (42). Often it is desirable to im- 
pose lower and upper bounds for the resistors because the technology 
used to realize them requires it. For example, if tantalum thin film 
resistors are used it is desirable to restrict them to lie between 10 and 
10° ohms. 

Let the kth variable resistor be connected from node 7 to node 7. The 
value of R; is given by 


Ve Vi. 
I, 
If it is desired to have this resistor lie within 10 and 10° ohms the 
following conditions are imposed 
V; ar: V; 
I, 
V; Tay V; 
I, 
which may be rewritten (recall J; is nonnegative) 
eee ee > 
V; - V; — 102, = iL (48) 
Ve a V; ~~ 10°7, = 0 
If instead of equation (42) inequalities similar to equation (48) 
are written for all variable resistors, the resulting circuits will have 
all variable resistors within specified upper and lower bounds (except 
for the possibility I, = 0, which implies an open circuit, in which 
case the resistor disappears altogether). . 
The problem of biasing of transistor networks with positive resis- 
tors is equivalent to solving 


= = : 
ro |H,}V! 0 


D,|-c12| 2 


R, = 


>10, 1,0 


<10°, 1,0 


Rh 





| (49) 


oOlo 
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o,i-] 29 


da] 


where D, and D, are diagonal matrices whose diagonal elements contain 
the minima and maxima for the variable resistors. By adding positive 
slack vectors w, and w, , equation (49) is equivalent to 


H,| H, | 0 |0/ 1 f 


0 | H; 0/0 /;V 0 
=- (50) 
D 1 — C ams U 0 Ww, 0 


D.{—-—C{| 0 ‘ulw, 0 


where U is a unit matrix and the vector on the left is restricted to be 
nonnegative. 








XI. RELAXING SIGN CONDITIONS 


So far it has been assumed that the direction of the current flow in 
variable resistors is known beforehand. This condition may not hold 
for some cases and hence it is desirable to relax it. 

When a variable in a linear programming problem is not required 
to be positive it is customary to write it as the difference of two posi- 
tive quantities. Thus if I, and V; — V; = V are not required to be 
positive one may write 


I, = Tye = Love 
V, = Vie = View 
where J, , Ine), Vir, Viv & O. 


A current J, of variable sign may be restricted to have a magnitude no 
less than I,, 2 0 by imposing the pair of conditions* 


I, 2 Lox or —I, 2 Lox . (51) 
Likewise a branch voltage V; of variable sign across a resistor may be 


* The constraint set on the currents is not convex, therefore it is necessary to 
solve the problem twice, once with each inequality, and take the union of the 
two solutions. If n variable resistors may have currents flowing in either direc- 
tion, the solution will be the union of the solutions of 2” problems in which 
all the combinations of the inequalities are used. 
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restricted to have a magnitude no greater than V,, = 0 by imposing the 
pair of conditions 


V,.SV, and —-V,5 Vy. (52) 


If V, is the voltage across the kth resistor and J, its current, then in- 
equalities (51) and (52) insure that the magnitude of the kth resistor 
satisfies 


Vn * 


The resistor R, may be negative or positive. However, if each variable 
resistor is made of two resistors in series one of value V.,/J., and the 
second to be determined by the computer subject to equation (53), the 
series combination of the two resistors will never be negative. This 
constitutes a technique for guaranteeing positive variable resistors 
without previous knowledge of the directions of current flows.** 

The method described can also handle circuits with variable resistors 
whose values lie within upper and lower limits. If R,,,,, and R,,,,, are 
the minimum and maximum values allowed for the kth variable resistor, 
the fixed series resistor should be 


Rus = Renin ae V on/ Lox (54) 
with V../I., chosen such that 
V on / Lo = ive = Ramin . (55) 


The value of F,,,;, may be zero. Thus, a resistor may disappear as a short 
circuit. If instead of bounding the value of a resistance from above, the 
value of an admittance is bounded, a dual method may be used to 
guarantee positive resistors. 

Instead of equations (51) and (52) the following restrictions are 
imposed 


I, J ox and —TI, Ty } (56) 
Vi 2 V oz or —V, = V ok .T (57) 


IIA 
IIA 


* Both Vox and J. are variables in the linear program which will be determined 
by the simplex algorithm. The ratio is constrained by a linear inequality. Jox | Rx. | 
—Voxr < 0, where he. | is given. 

** Another approach is to reverse the reference direction of the current and 
voltage drop across each variable resistor and apply the methods of the previous 
section. If n variable resistors may have currents flowing in either direction 
it is necessary to consider 2” possibilities. 

ft See footnote to equation 51. 
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These guarantee that 


Ls 
—_ Tok 
al s (58) 
where G, = 1/R, . G, may be positive or negative if each variable 


resistor is made of two resistors in parallel, one of admittance [4./V ox 
and the second to be determined by the computer subject to equation 
(58). However, the parallel combination of the resistors will never be 
negative. 

The dual method can also handle circuits with variable resistors whose 
admittance lies within upper and lower limits G;,,,,, and G,,,;, - The value 
of G,,,,, may be zero. Thus a resistor may disappear as an open circuit. 


XII. CHOOSING TOPOLOGY BY COMPUTER 


As already pointed out, Phase I of the simplex method obtains the 
basic feasible solutions of a set of linear equations. The set of equa- 
tions may come from a set of equalities and inequalities to which 
slack variables have been added. Usually the number of variables 
(including slack variables) is greater than the number of equations 
and the system is redundant. If r is the rank of the system and n is 
the number of variables (including slack variables), at least n-r 
variables are set to zero in obtaining a basic feasible solution. Some of 
the variables set to zero may be node voltages or variable resistor 
currents. If a node voltage is set to zero, the corresponding node is 
grounded. If a variable resistor current is set to zero, the corre- 
sponding resistor disappears as an open circuit. If a slack variable 
is set to zero, the inequality constraints are met with equalities. 

For example, for equation (50) if the kth entry of w, is zero, the 
kth resistor acquires its minimum allowed value. 

One way of viewing equation (50) is to consider the columns of 
the matrix on the left as elements of a vector space and the entries 
of the column multiplying the matrix as those positive coefficients 
which synthesize the column on the right in the form of a linear com- 
bination of the columns of the matrix. A final tableau of Phase I of 
the simplex method will contain a number of independent unit col- 
umns (with all entries zero except one) equal to the rank of the mat- 
rix on the left side of equation (50). The unit columns are obtained 
by the special gaussian reduction provided by the simplex algorithm. 
Kach column corresponds to a variable in the column multiplying the 
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matrix of equation (50). Those variables whose corresponding col- 
umns are not unit columns are set to zero. 

If a set of columns corresponding to the currents through a set of 
variable resistors are linearly dependent, one or more of the currents 
will be set to zero. This implies the disappearance of a resistor as an 
open circuit. The choice of which resistors disappear is automatically 
determined with the aid of the simplex algorithm, so that the non- 
zero currents acquire positive values (if such a choice exists). If two 
columns of the matrix of equation (50), corresponding to currents 
through variable resistors, are linearly dependent it means that Kirch- 
hoff’s voltage and current law may be satisfied with one of the cur- 
rents zero, making one of the resistors unnecessary. 

The above argument provides a method for letting a computer pro- 
gram choose the topology and resistor values of a de network in 
which certain voltages and currents are imposed by CFE’s and VFE’s. 
One connects an excess of resistors between different nodes (includ- 
ing additional internal nodes if desired). By using a linear program- 
ming formulation some node voltages and variable resistor currents 
are set to zero by the computer program, thus determining a set of 
“linearly independent positive resistors” that satisfy all the circuit 
equations. 


XII. EXAMPLES 


Consider the circuit of Fig. 19(a). The equivalent circuit is shown 
in Fig. 19(b) with a VFE and CFE in place. As indicated on Fig. 
19(b) it is desired to impose on the transistor a collector current of 
5 mA and a collector-emitter voltage of 5 volts. The resistors marked 
R,, Ro and Rz are variable. 

The nodal equations for the circuit after the effect of the nullators 
introduced by the VFE’s and CFE’s are taken into consideration 
are, in matrix form 





et. Le. Q 0.005 —0.005 0. —-1 x10" 
0. 0. 1. —0.38 0.3811 —0.0011 0. 
| 0. 0. 0 0. 0. 0.00333 0 
1. 0. 0. —1. x 107% 0. 0. 1 
0. 0. O 0. —0.001 0.001 0. 
L 0. 0. O 0.375 —0.8751 0.0001 0 
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in 12V Rz=in E3i2v 


3000 





(a) CIRCUIT 


a 19 — Circuit biased with constrained singular imbedding; (a) circuit, (b) 
model. 








hi 0.0035 | 
I, —0.27 
I, 0.035 
-| V.| = 12. : (59a) 
Ms 0.005 
ye 0.2675 | 
Vs 


By multiplying the second row by —1, the entry —0.27 in the right 
side vector is made positive. (As indicated above, linear program- 
ming assumes the right side vector is nonnegative). Notice that since 
the matrix in equation (59a) is 6 X 7, we therefore generally expect 
a one parameter infinity of solutions. If the system of equations were 
solved using the simplex method (with arbitrary cost coefficients), 
solutions in which all the variables acquire non-negative values may 
be obtained. Resistors Rez and R3, which are grounded, will auto- 
matically be positive. However the voltage differences across un- 
grounded resistors may turn out to be negative, yielding negative re- 
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sistances. To assure a non-negative voltage difference across R, the 
following additional constraint will be imposed 


Vs —- V220 
which may also be written* 
V.—V; 0. (59b) 
There are two basic feasible solutions to this problem: 
I, 5.788 5.9966425 x 10° 
I, 5.78794 0. 
I; 5.060073 x 107° 5.060073 X 107° 
V.|= 6.212 , 6.212 : (60) 
V3 5.5 5.5 
V4 10.5 10.5 
V; 6.212 11.99994 


The first basic solution yields the following set of resistors 


Bie a yy, eggs 7s oo Ooms 

— Ve _ 6.212 _ 
i= I, ~ 5.78794 ~ 1.073266 ohms (61) 
R; = Le eeeereeeet cee = 1086.941 ohms. 


FR, is a short circuit. 
The second basic solution yields the set 


R, = 97008.514, hk, = ©, Rk; = 1086.941. 


Re is an open circuit. Notice also that Rg is the same for both solu- 
tions. This is expected since the voltage of node 8 is virtually fixed 
by the requirements. 

The totality of the solutions with non-negative voltage differences 
across the variable resistors may be written, according to equation (47) 


x= de + (1 — Ax” 
where x’ and x’ are the basic feasible solutions of equation (60), and 
Of), sl. 


* When the right side of an inequality is zero, it is preferable to write it as 
a <S inequality because the corresponding slack variable may be used as an 


artificial variable with savings on the size of the matrix to be manipulated. 


CIRCUIT DESIGN 311 


Choosing \ = 3 yields 

2.89402 

2.89397 
5.060073 X 10° 

x= | 6.212 

5.0 

10.5 

9.10597 


which yields the set of resistors 


R, = 0.997906, R, = 2.146532, R; = 1086941. 


A continuous set of equivalent circuits, which achieve the require- 
ments exactly and which have positive resistances, is obtained by 
varying A between 0 to 1. 
Suppose now that further considerations require that R, lie be- 
tween 1000 and 2000 ohms. By replacing (59b) by 
V; — V2 Vs; — V2 


15 Tey Be 
1, = 1000 and L, < 2000 


which may be written 
1000 I, = V; + V2 =< 0 


the resistor FR, is forced to remain between 1000 and 2000 ohms. 
When the new problem is solved the basic feasible solutions are 


(62) 


5.782218 x 10°° 2.892554 X 107° 

5.722218 X 10° 2.832554 X 107° 

5.060073 X 10° 5.060073 x 107° 
z= 6.212 ,. <2 = 6212 

5.5 5.5 

10.5 10.5 


11.99422 11.99711 
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The basic feasible solutions yield the following sets of resistors 


« :R, = 1000., R, = 1085.593, Rk; = 1086.941 
x’ :R, = 2000., R, = 2193.074, Rk, = 1086.941. 


Notice that R, acquired its allowable extreme values in each basic 
feasible solution. 

Other sets of resistances may be obtained by convex combinations 
of the two basic feasible solutions. 

As an example in which the topology of a circuit is determined by 
the computer, consider the circuit of Fig. 16 in which R;, Re, and R3 
are to be selected to give 2;; = 34 and Z., = 14. The example was 
previously solved without linear programming techniques. Several 
solutions appear in Table V. By maximizing the negatives of the cur- 
rents in the resistors, those currents which may be set to zero by tak- 
ing them out of the basis for a basic feasible solution will be con- 
verted into open circuits. After the effect of the nullators introduced 
by the VFE’s is accounted for, the matrix corresponding to the 
circuit of Fig. 16 is 5 X 6. We therefore expect a one parameter in- 
finity of solutions and two basic feasible solutions which are 


I 


ie 0.0 | 0.28174743 

I, 5.9652404 x 107° 45937138 X 107° 
a In| _ 5.965238 x 10 oe pa 00 

V.|  44.096881 43 602828 

V; 0.3333333 0.33639577 

V; 0.3333333 0.3333333 


Notice that V4 remains constant for both basic feasible solutions. This 
is expected since a VFE is connected from node 4 to node 1 (datum). 
The resistances corresponding to the basic feasible solutions are 


2 :R, = © R, = 55.879273, — Rs = 7336.429 
x’ :R, = 154.4759, R, = 72.5629, Rz, = ©, 


In both basic feasible solutions one of the resistances disappeared as 
an open circuit. This indicates that given R4, Rs and Rg with the 
values indicated in Fig. 16 the circuit is achievable with two topologies, 
each containing 5 resistors. 

Let us now make Rez a variable resistor. The nodal matrix after the 
elimination of the nullators is now 5 xX 7. Thus we expect a two 
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parameter infinity of solutions and at least 3 basic feasible solutions. 
The following sets of resistors correspond to basic feasible solutions 


(i) Rp =o, R, =, Rs = 1.33333, Ry = 0.16666667 
(ii) Rj = 0, R, 
(iii) R, = 0.88888898,  R, = 1.333338, R,=», R,= 

(iv) R, = 13333336, R,= 0, R,= 0, Ry = 0.66666651. 


These sets provide four different topologies with which given two 
of the resistors (R4 and Rs) a resistive network having 23; = %4, 221 
= 14 may be realized. 

The example illustrates how using the methods of this paper can 
solve the problem of realizing portions of a resistive matrix with cer- 
tain elements prespecified. The prespecified elements need not be 
resistors but may also include controlled sources, gyrators, ideal trans- 
formers, and so on. 

The methods discussed have been implemented on a time-shared 


0.333333, R; = 0.4444456, Re = © 


TaBLeE VI—PRINTOUT OF THE SESSION TO SOLVE THE CIRCUIT OF 
Ficure 19 


EE Nee OF BRANCHES,NODES,CONTROLLED SOURCES,BATTERIES,CURRENT SOURCES 
TYPE BRANCH RESISTANCES 

B=!l. le 1. 208. 1.E4 308 e 

TYPE FOR EACH BRANCH: INITIAL NODE, FINAL Cent Ree NO. 

C=5 2 1 2t1 311 232 431 15 3 

TYPE VALUES OF BATTERIES 

DO. -.7 12. 

TYPE FOR EACH CONTROLLED SOURCE: BRANCH NO. AND CONTROLLING BRANCH NO. 
E=6 5 

TYPE VALUES OF BETAS 

Fx=75. 


OPTION COMMANDS=DESIGN CKT 
TYPE NO VARIABLE RESISTANCES,NO. VOLTAGE CONSTRAINTS, AND NO CURRENT CONSTRAINTS 
TYPE BRANCH NO. OF VARIABLE RESISTANCES 

TYPE PLUS AND MINUS WODES FOR EACH VFE 

TYPE VALUE OF EACH VFE 


L=5. 

TYPE BRANCH CURRENT FOR EACH CFE 
M=5 

TYPE VALUE OF EACH CFE 

N=.005 

TYPE COST COEFFICIENTS 

O=l. te 1. te Ie le 

TYPE MINIMA OF VAR TABLE RESISTANCES 
P=1020, 2. OG. 

TYPE MAXIMA FOR EACH VARIABLE RESISTANCE 
Q22008. 1.E8 1.E8 


RC 1)= 9.9999995E+B82 
RC 2)= 1.0855930E+03 
RC 3)= 1.086 9498 E+O3 
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computer system. The program is conversational. A portion of a ses- 
sion in which a basic solution corresponding to the circuit of Fig. 19 
with R, constrained betwen 1000 and 2000 ohms appears in Table VI. 


XIV. CONCLUSIONS 

The method of singular imbedding has been shown to be efficient for 
solving the following problem: Given a circuit with a prespecified 
topology, some of whose elements are prespecified, find the values of 
the unspecified elements which will yield desired node-pair voltages 
or branch currents. The unspecified element values may be restricted 
to lie within given upper and lower bounds. 

By letting the upper and lower bounds become infinite and zero, 
the problem of finding the topology for the circuit may be also solved. 

The method has been implemented on a time-shared computer, 
and several examples, including some practical transistor circuits, are 
given. 

The usual approaches to the problems of this paper have been itera- 
tive analysis-optimization schemes. Singular imbedding requires, for 
a three transistor amplifier, three orders of magnitude less computa- 
tion time. This makes the method appealing for time-shared applica- 
tions. 

Two new singular network elements, the voltage forcing element 
and the current forcing element, constrain node-pair voltages and 
branch currents without otherwise affecting the circuit. Elements of 
unspecified value are modeled by branches carrying unknown cur- 
rents. 

With the aid of these elements, the problem of design is reduced to 
one of analyzing a circuit containing unknown current sources and 
nullators. If there are more free elements than requirements, the solu- 
tion space may be a linear manifold. By allowing the free circuit 
variables to take on a set of discrete values, sets of exact solutions 
to the design problem may be generated economically. 

When the unspecified elements are required to lie within upper and 
lower bounds, the problem is one of analysis with linear inequality 
constraints. This may be solved efficiently using linear programming 
techniques. 

Among the practical problems solved by singular imbedding are 
biasing a direct coupled transistor amplifier, designing midband gain 
and driving point impedance, synthesizing networks for several given 
admittance parameters, and determining circuit topology. 

Areas being investigated include using singular imbedding in the 
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synthesis of resistance networks (the synthesis of a single column of 
a specified resistance matrix has been illustrated). Synthesis of an 
entire resistance matrix results from the intersection in resistance 
space of the solution spaces for each column of the matrix. Similarly, 
by considering the intersection of solutions spaces for both a small 
signal design and a biasing design, the method may be extended to 
designing transistor circuits for desired small signal design and bias 
points simultaneously. 

Although only fixed value CFE’s and VFE’s were used in this 
paper, CFE’s and VFE’s which may take any value within a given 
range may also be used. For example, a branch current may be forced 
to be greater than 1 mA and less than 10 mA. These elements are also 
useful in insuring that models for devices stay within their valid 
limits. For example, a transistor can be constrained to remain in the 
active region, for which the linear model used is valid. 

For simplicity, only the case of linear de networks has been il- 
lustrated in this paper. However, the method has usefulness in ac 
design, combined ac and de design, and non-linear design. These topics 
will be covered elsewhere. 
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